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(54) Titre : GENOME DE LISTERIA MONOCYTOGENES, POLYPEPTIDES ET UTILISATIONS 

(57) Abstract: The invention concerns the genome sequence and nucleotide sequences of Listeria monocytogenes EGD-e. The 
invention also concerns polypeptides of said organism, in particular surface or cell envelope polypeptides, or polypeptides involved 
in the different metabolism cycles, in particular vitamin B 12 biosynthesis. The invention further concerns uses of said sequences, and 
various tools enabling identification of L. monocytogenes or associated species. Finally, the invention concerns modified <I>L</I>. 
monocytogenes strains, and pharmaceutical or vaccine compositions using the inventive sequences. 

(57) Abrege : La presente invention concerne la sequence genomique et des sequences nucleotidiques de Listeria monocytogenes 
EGD-e. L invention a egalement pour objet les polypeptides de cet organisme, en particulier les polypeptides de surface ou d'enve- 
loppe cellulaire, ou impliques dans les differents cycles de metabolisme, en particulier la biosynthese de vitamine B12. L' invention 
concerne aussi les utilisations des sequences decrites, ainsi que differents outils permettant 1' identification de L. monocytogenes ou 
especes associees. L'invention concerne ausi des souches L. monocytogenes modifiees, ainsi que les compositions pharmaceutiques 
ou vaccinales utilisant les sequences de l'invention. 
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GENOME DE Listeria monocytogenes, POLYPEPTIDES ET UTILISATIONS 

L 5 invention a pour objet la sequence genomique et des sequences 
5 nucleotidiques codant pour des polypeptides de Listeria monocytogenes, tels que 
des polypeptides d'enveloppe cellulaire, secretes ou specifiques, ou impliques dans 
le metabolisme, dans le processus de replication ou dans la virulence, ainsi que des 
vecteurs incluant lesdites sequences et cellules ou animaux transformes par ces 
vecteurs. L' invention concerne egalement des procedes de detection de ces acides 

10 nucleiques ou polypeptides et des kits de diagnostic d' infection par Listeria 
monocytogenes. L'invention vise aussi une methode de selection de composes 
capables de moduler P infection bacterienne et un procede de biosynthese ou de 
biodegradation de molecules d'interet utilisant lesdites sequences nucleotidiques ou 
lesdits polypeptides. L'invention comprend enfin des compositions 

15 pharmaceutiques, notamment vaccinales, pour la prevention et/ou le traitement 
d 5 infections bacteriennes, en parti culier par Listeria monocytogenes. 

Listeria monocytogenes est un pathogene intracellulaire facultatif II s'agit 
de T agent etiologique de la listeriose, une infection liee a la nourriture posant des 
problemes de sante publique de plus en plus importants, avec un impact 

20 economique important pour Pindustrie alimentaire europeenne. La listeriose est 
P infection liee aux aliments la plus letale (mortalite d' environ 30 %). Listeria 
monocytogenes possede la propriete inhabituelle d'etre capable de traverser trois 
barrieres : La barriere intestinale, la barriere hemato-encephalique et la barriere 
placentaire. Les manifestations cliniques de la listeriose incluent les meningites, 

25 meningoencephalites, avortements et septicemics. Cette infection est opportuniste 
et affecte principalement les femmes enceintes, les bebes, les personnes agees et les 
personnes immuno-deprimees en particulier les personnes atteintes du SIDA. Cette 
maladie affecte apparemment egalement les individus sains et est responsable d'un 
nombre important d'epidemies en raison de produits alimentaires contamines. 

30 Listeria monocytogenes est egalement d'une importance veterinaire avec un risque 
principal pour les ovins (moutons) et les bovins. Listeria monocytogenes est 
particulierement resistante au stress ou aux conditions extremes et il est important 
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de rechercher sa presence avec soin non seulement pour des problemes de securite 
alimentaire mais egalement pour des problemes de securite environnementale. 

La carte physique et une carte genetique preliminaire du genome de Listeria 
monocytogenes ont ete etablies pour la souche L028. Neanmoins, aucune carte 
5 genetique fine n'est disponible pour Tinstant. Le genome de cette bacterie est 
circulaire et comporte environ 3000 kilobases. Son contenu en GC est d'environ 38 
%. Les etudes des facteurs de virulence ont permis Fidentikcaiiori d'un locus de 15 
kb qui peut etre considere comme etant un ilot de pathogenicity dans la mesure ou il 
contient la plupart des genes dont la fonction dans la virulence a ete clairement 

1 0 identifiee. En plus de ce locus, quelques autres genes ont ete identifies en particulier 
les genes d' invasion et de motilite et des genes qui codent pour une hydrolase 
mureine, une superoxide dismutase, des facteurs sigma, etc... 

Une famille importante de proteines de Listeria monocytogenes est la 
famille des proteines de surface. Les processus d 'evolution ont permis le 

15 developpement d'un nombre de mecanismes uniques sur les bacteries Gram+, par 
lesquels elles peuvent immobiliser des proteines a leur surface. Les fonctions de ces 
differentes proteines de parois cellulaires sont extremement diverses. Pourtant, 
beaucoup de proteines liees de fa<?on covalente a la surface des pathogenes Gram-t- 
sont estimees etre importantes pour la survie du pathogene a l'interieur de Fhote 

20 infecte. Pour Listeria monocytogenes, la capacite a penetrer dans les cellules 
eukaryotes a ete liee a une famille de proteines de surface et/ou secretees, les 
internalines. Jusqu'ici, neuf membres de la famille des internalines ont ete identifies 
(InlA, InlB, InlC, InlC2, InlD, InlE 5 InlF, InlG, InlH). On pense qu' elles sont 
ancrees dans la paroi cellulaire par un clivage proteolytique de la liaison T-G du 

25 motif LPxTG et que la liaison covalente du groupe carboxylique de la threonine a 
un groupe amino libre dans le peptidoglycane. Des etudes recentes ont montre qu'il 
y avait deux classes d' internalines ; un groupe de proteines associees a la paroi 
cellulaire de haut point moleculaire (> 50 kDa) et un groupe de plus petites 
proteines (< 30 kDa) qui sont secretees. Ces deux classes possedent des motifs 

30 similaires LRR tres homologues, ainsi que les regions flanquantes LRR, et la 
sequence peptide-signal N-terminale. Les petites internalines (s-inl) ne possedent 
pas la region repetee B et la sequence d'ancrage dans la paroi cellulaire et sont ainsi 
secretees. 
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L' etude de Listeria monocytogenes demande de nouvelles approches, en 
particulier genetiques, afm d'ameliorer la comprehension ,des .differentes voies 
metaboliques de cet organisme. 

Ainsi, c'est un objet de la presente invention que de divulguer la sequence 
5 complete du genome de Listeria monocytogenes EGD-e deposee a la CNCM le 1 1 
avril 2000 sous le numero 1-2440 ainsi que de tous les genes contenus dans cedit 
genome. 

En effet, la connaissance du genome de cet organisme permet de mieux 
definir les interactions entre les differents genes, les differentes proteines, et par la- 
1 0 meme, les differentes voies metaboliques. En effet, et contrairement a la divulgation 
de sequences isolees, la sequence genomique complete d'un organisme forme un 
tout, permettant d'obtenir immediatement toutes les informations necessaires a cet 
organisme pour croitre et fonctionner. ,* « 

La presente invention concerne done une sequence nucleotidique de Listeria 
1 5 monocytogenes caracterisee en ce qu'elle correspond a la sequence SEQ ID N° 1 . 

La presente invention concerne egalement une sequence nucleotidique de 
Listeria monocytogenes caracterisee en ce qu'elle est choisie parmi : 

a) une sequence nucleotidique comportant au moins 80 %, 85 %, 90 %, 95 
% ou 98 % d'identite avec SEQ ID N° 1 ; 
20 b) une sequence nucleotidique hybridant dans des conditions de forte 

stringence avec SEQ ID N° 1 ; 
c) une sequence nucleotidique complementaire de SEQ ID 
N° 1 ou complementaire d'une sequence nucleotidique telle que definie 
en a), ou b), ou une sequence nucleotidique de 1* ARN correspondant ; 
25 d) une sequence nucleotidique de fragment representatif de SEQ ID N° 1, 

ou de fragment representatif d'une sequence nucleotidique telle que 
defmie en a), b) ou c); 
e) une sequence nucleotidique comprenant une sequence telle que definie 
en a), b) ? c) ou d) ; et 

30 f) une sequence nucleotidique modifiee d'une sequence nucleotidique telle 

que definie en a), b), c), d) ou e). 
De fa^on plus particuliere, la presente invention a egalement pour objet les 
sequences nucleotidiques caracterisees en ce qu'elles sont issues de SEQ ID N° 1 et 
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en ce qu'elles codent pourun polypeptide choisi parmi les polypeptides de sequence 
SEQ ID N° 2 a SEQ ED N° 2854, de preference codant pour un polypeptide 
d'enveloppe cellulaire ou present a la surface de Listeria monocytogenes de 
sequence SEQ ID N° 2 a SEQ ID N° 41 ou codant pour un polypeptide implique 
5 dans la biosynthese de la vitamine B12 de sequence SEQ ID N° 42 a SEQ ID N°64. 

La presente invention concerne aussi de fa?on plus generate les sequences 
nucleotidiques issues de SEQ ID N° 1, et codant pour un polypeptide de L. 
monocytogenes ; telles qu'elles peuvent etre isolees a partir de SEQ ID N° 1. 

De plus, les sequences nucleotidiques caracterisees en ce qu'elles 
1 0 comprennent une sequence nucleotidique choisie parmi : 

a) une sequence nucleotidique codant pour un polypeptide choisi parmi les 
sequences SEQ ID N° 2 a SEQ ID N°2854, de preference choisi parmi les 
polypeptides de sequence SEQ ID N° 2 a SEQ ID N° 41 ou SEQ ID N° 42 a 
SEQ ID N° 64 ; 

15 b) une sequence nucleotidique comportant au moins 80 %, 85 %, 90 %, 95 % ou 
98 % d'identite avec une sequence nucleotidique codant pour un polypeptide 
choisi parmi les sequences SEQ ID N° 2 a SEQ ID N° 2854, de preference 
choisi parmi les polypeptides de sequence SEQ ID N° 2 a SEQ ID N° 41 ou 
SEQ ID N° 42 a SEQ ID N° 64 ; 

20 c) une sequence nucleotidique s'hybridant dans des conditions de forte stringence 
avec une sequence nucleotidique codant pour un polypeptide choisi parmi les 
sequences SEQ ID N° 2 a SEQ ID N° 2854, de preference choisi parmi les 
polypeptides de sequence SEQ ID N° 2 a SEQ ID N° 41 ou SEQ ID N° 42 a 
SEQ ID N° 64 ; 

25 d) une sequence nucleotidique complementaire ou d'AKN correspondant a une 
sequence telle que definie en a), b) ou c) ; 

e) une sequence nucleotidique de fragment representatif d'une sequence telle que 
definie en a), b), c) ou d) ; et 

f) une sequence nucleotidique modifiee d'une sequence telle que definie en a), b), 
30 c), d) ou e), 

sont egalement des objets de l'invention. 

Par acide nucleique, sequence nucleique ou d'acide nucleique, 
polynucleotide, oligonucleotide, sequence de polynucleotide, sequence 
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nucleotidique, termes qui seront employes indifferemment dans la presente 
description, on entend designer un enchainement precis de nucleotides, modifies ou 
non, permettant de definir un fragment ou une region d'un acide nucleique, 
comportant ou non des nucleotides non naturels, et pouvant correspondre aussi bien 
5 a un ADN double brin, un ADN simple brin que des produits de transcription 
desdits ADNs. Ainsi, les sequences nucleiques selon Tinvention englobent 
egalement les PNA (Peptid Nucleic Acid), ou analogues. 

II doit etre compris que la presente invention ne concerne pas les sequences 
nucleotidiques dans leur environnement chromosomique naturel, c'est-a-dire a l'etat 

1 0 naturel II s'agit de sequences qui ont ete isolees et/ou purifiees, c'est-a-dire qu'elles 
ont ete prelevees directement ou indirectement, par exemple par copie, leur 
environnement ayant ete au moins partiellement modifie. On entend ainsi egalement 
designer les acides nucleiques obtenus par synthese chimique. 

Par « pourcentage d'identite » entre deux sequences d 5 acides nucleiques ou 

15 d'acides amines au sens de la presente invention, on entend designer un 
pourcentage de nucleotides ou de residus d 'acides amines identiques entre les deux 
sequences a comparer, obtenu apres le meilleur aiignement, ce pourcentage etant 
purement statistique et les differences entre les deux sequences etant reparties au 
hasard et sur toute leur longueur. On entend designer par "meilleur aiignement" ou 

20 "aiignement optimal", l'alignement pour lequel le pourcentage d'identite determine 
comme ci-apres est le plus eleve. Les comparaisons de sequences entre deux 
sequences d' acides nucleiques ou d' acides amines sont traditionnellement realisees 
en comparant ces sequences apres les avoir alignees de maniere optimale, ladite 
comparaison etant realisee par segment ou par « fenetre de comparaison » pour 

25 identifier et comparer les regions locales de similarite de sequence. L'alignement 
optimal des sequences pour la comparaison peut etre realise, outre manuellement, 
au moyen de Palgorithme d'homologie locale de Smith et Waterman (1981, Ad. 
App. Math. 2 : 482), au moyen de Talgorithme d'homologie locale deNeddleman et 
Wunsch (1970, J. Mol. Biol. 48 : 443), au moyen de la methode de recherche de 

30 similarite de Pearson et Lipman (1988, Proc. Natl. Acad. Sci. USA 85 : 2444), au 
moyen de logiciels informatiques utilisant ces algorithmes (GAP, BESTFIT, 
BLAST P, BLAST N, FASTA et TFASTA dans le Wisconsin Genetics Software 
Package, Genetics Computer Group, 575 Science Dr., Madison, WI). Afin d'obtenir 
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1' alignement optimal, on utilise de preference le programme BLAST, avec la 
matrice BLOSUM 62. On peut egalement utiliser les matrices PAM ou PAM250. 

Le pourcentage d'identite entre deux sequences d'acides nucleiques ou 
d'acides amines est determine en comparant ces deux sequences alignees de 
5 maniere optimale dans laquelle la sequence d'acides nucleiques ou d'acides amines 
a comparer peut comprendre des additions ou des deletions par rapport a la 
sequence de reference pour un alignement optimal entre ces deux sequences. Le 
pourcentage d'identite est calcule en determinant ie nombre de positions identiques 
pour lesqueiles le nucleotide ou le residu d'acide amine est identique entre les deux 

10 sequences, en divisant ce nombre de positions identiques par le nombre total de 
positions comparees et en multipliant le resultat obtenu par 100 pour obtenir le 
pourcentage d'identite entre ces deux sequences. 

Par sequences nucleiques presentant un pourcentage d'identite d'au moins 
80 %, de preference 85 % ou 90 %, de fa?on plus preferee 95 % voire 98 %, apres 

15 alignement optimal avec une sequence de reference, on entend designer les 
sequences nucleiques presentant, par rapport a la sequence nucleique de reference, 
certaines modifications comme en particulier une deletion, une troncation, un 
allongement, une fusion chimerique et/ou une substitution, notamment ponctuelle, 
et dont la sequence nucleique presente au moins 80 %, de preference 85 %, 90 %, 

20 95 % ou 98 %, d'identite apres alignement optimal avec la sequence nucleique de 
reference. II s'agit de preference de sequences dont les sequences complementaires 
sont susceptibles de s'hybrider specifiquement avec les sequences de reference. De 
preference, les conditions d'hybridation specifiques ou de forte stringence seront 
telles qu'elles assurent au moins 80 %, de preference 85 %, 90 %, 95 % ou 98 % 

25 d'identite apres alignement optimal entre Tune des deux sequences et la sequence 
complementaire de T autre. 

Une hybridation dans des conditions de forte stringence signifie que les 
conditions de temperature et de force ionique sont choisies de telle maniere qu'elles 
permettent le maintien de 1' hybridation entre deux fragments d'ADN 

30 complementaires. A titre illustratif, des conditions de forte stringence de l'etape 
d'hybridation aux fins de defmir les fragments polynucleotidiques decrits ci-dessus, 
sont avantageusement les suivantes. 
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L 5 hybridation ADN-ADN ou ADN-ARN est realisee en deux etapes : (1) 
prehybridation a 42°C pendant 3 heures en tampon phosphate (20 mM, pH 7,5) 
contenant 5 x SSC (1 x SSC correspond a une solution 0,15 M NaCl + 0,015 M 
citrate de sodium), 50 % de formamide, 7 % de sodium dodecyl sulfate (SDS), 10 x 
5 Denhardt's, 5 % de dextran sulfate et 1 % d'ADN de sperme de saumon ; (2) 
hybridation proprement dite pendant 20 heures a une temperature dependant de la 
taille de la sonde (i.e. : 42°C, pour une sonde de taille > 100 nucleotides) suivie de 2 
lavages de 20 minutes a 20°C en 2 x SSC + 2 % SDS, 1 lavage de 20 minutes a 
20°C en 0,1 x SSC + 0,1 % SDS. Le dernier lavage est pratique en 0,1 x SSC + 

10 0,1 % SDS pendant 30 minutes a 60°C pour une sonde de taille > 100 nucleotides. 
Les conditions d' hybridation de forte stringence decrites ci~dessus pour un 
polynucleotide de taille definie, peuvent etre adaptees par Fhomme du metier pour 
des oligonucleotides de taille plus grande ou plus petite, selon Tenseignement de 
Sambrook et aL, (1989, Molecular cloning : a laboratory manual 2 nd Ed. Cold 

1 5 Spring Harbor). 

De plus, par fragment representatif de sequences selon V invention, on 
entend designer tout fragment nucleotidique presentant au moins 15 nucleotides, de 
preference au moins 20, 25, 30, 50, 75, 100, 150, 300 et 450 nucleotides consecutifs 
de la sequence dont il est issu. 

20 Par fragment representatif, on entend en particulier une sequence nucleique 

codant pour un fragment biologiquement actif d'un polypeptide, tel que defini plus 
loin. 

Par fragment representatif, on entend egalement les sequences intergeniques, 
et en particulier les sequences nucleotidiques portant les signaux de regulation 
25 (promoteurs, terminateurs, voire enhancers. . . ). 

Parmi lesdits fragments representatifs, on prefere ceux ayant des sequences 
nucleotidiques correspondant a des cadres ouverts de lecture, denommes sequences 
ORFs (ORF pour « Open Reading Frame »), compris en general entre un codon 
d 5 initiation et un codon stop, ou entre deux codons stop, et codant pour des 
30 polypeptides, de preference d'au moins 100 acides amines, tel que par exemple, 
sans s'y limiter, les sequences ORFs qui seront decrites par la suite. 
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La numerotation des sequences nucleotidiques ORFs qui sera utilisee par la 
suite dans la presente description correspond a la numerotation des sequences 
d'acides amines des proteines codees par lesdites ORFs. 

Les fragments representatifs selon V invention peuvent etre obtenus par 
5 exemple par amplification specifique telle que la PGR ou apres digestion par des 
enzymes de restriction appropries de sequences nucleotidiques selon T invention, 
cette methode etant decrite en particulier dans Fouvrage de Sambrook et aL Lesdits 
fragments representatifs peuvent egalement etre obtenus par synthese chimique lors 
que leur taille n'est pas trap importante, selon des methodes bien connues de 
1 0 rhomme du metier. 

Parmi les sequences contenant des sequences de V invention, ou des 
fragments representatifs, on entend egalement les sequences qui sont naturellement 
encadrees par des sequences qui presentent au moins 80 %, 85 %, 90 %, 95 % ou 98 
% d'identite avec les sequences selon Finvention. 
15 Par sequence nucleotidique modifiee, on entend toute sequence 

nucleotidique obtenue par mutagenese selon des techniques bien connues de 
Phomme du metier, et comportant des modifications, de preference au maximum 10 
% de nucleotides modifies, par rapport aux sequences normales, par exemple des 
mutations dans les sequences regulatrices et/ou promotrices de T expression du 
20 polypeptide, notamment conduisant a une modification du taux d' expression ou de 
Factivite dudit polypeptide. 

Par sequence nucleotidique modifiee, on entend egalement toute sequence 
nucleotidique codant pour un polypeptide modifie tel que definit ci-apres. 

Les fragments representatifs selon Finvention peuvent egalement etre des 
25 sondes ou amorces, qui peuvent etre utilisees dans des procedes de detection, 
d' identification, de dosage ou d 5 amplification de sequences nucleiques, 

Une sonde ou amorce se definit, au sens de Finvention, comme etant un 
fragment d'acides nucleiques simple brin ou un fragment double brin denature 
comprenant par exemple de 12 bases a quelques kb, notamment de 15 a quelques 
30 centaines de bases, de preference de 15 a 50 ou 100 bases, et possedant une 
specificite d' hybridation dans des conditions determinees pour former un complexe 
d'hybridation avec un acide nucleique cible. 
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Les sondes et amorces selon T invention peuvent etre marquees directement 
ou indirectement par un compose radioactif ou non radioactif par des methodes bien 
connues de l'homme du metier, afin d'obtenir un signal detectable et/ou 
quantifiable. 

5 Les sequences de polynucleotides selon 1' invention non marquees peuvent 

etre utilisees directement comme sonde ou amorce. 

Les sequences sont generalement marquees pour obtenir des sequences 
utilisables pour de nombreuses applications. Le marquage des amorces ou des 
sondes selon Finvention est realise par des elements radioactifs ou par des 
1 0 molecules non radioactives. 

Parmi les isotopes radioactifs utilises, on peut citer le 32 P, le 33 P, le 35 S, le 3 H 

1 25 

ou le I Les entites non radioactives sont selectionnees parmi les ligands tels la 
biotine, i'avidine, la streptavidine, la dioxygenine, les haptenes, les colorants, les 
agents luminescents tels que les agents radioluminescents, chemoluminescents, 

1 5 bioluminescents, fluorescents, phosphorescents. 

Les polynucleotides selon Finvention peuvent ainsi etre utilises comme 
amorce et/ou sonde dans des precedes mettant en oeuvre notamment la technique de 
PCR (amplification en chaine par polymerase) (Rolfs et al., 1991, Berlin : Springer- 
Verlag). Cette technique necessite le choix de paires d'amorces oligonucleotidiques 

20 encadrant le fragment qui doit etre amplifie. On peut, par exemple, se referer a la 
technique decrite dans le brevet americain U.S. N° 4,683,202. Les fragments 
amplifies peuvent etre identifies, par exemple apres une electrophorese en gel 
d' agarose ou de poly aery lamide, ou apres une technique chromatographique comme 
la filtration sur gel ou la chromatographic echangeuse d'ions, puis sequences. La 

25 specificite de F amplification peut etre controlee en utilisant comme amorce les 
sequences nucleotidiques de polynucleotides de Finvention comme matrice, des 
plasmides contenant ces sequences ou encore les produits d' amplification derives. 
Les fragments nucleotidiques amplifies peuvent etre utilises comme reactifs dans 
des reactions d 5 hybridation afin de mettre en evidence la presence, dans un 

30 echantillon biologique, d'un acide nucleique cible de sequence complementaire a 
celle desdits fragments nucleotidiques amplifies. 

L'invention vise egalement les acides nucleiques susceptibles d'etre obtenus 
par amplification a Faide d'amorces selon Finvention. 
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D'autres techniques d' amplification de Tacide nucleique cible peuvent etre 
avantageusement employees comme alternative a la PCR (PCR-like) a Taide de 
couple d 5 amorces de sequences nucleotidiques selon V invention. Par PCR-like on 
entend designer toutes les methodes mettant en oeuvre des reproductions directes ou 
5 indirectes des sequences d'acides nucleiques, ou bien dans lesquelles les systemes 
de marquage ont ete amplifies, ces techniques sont bien entendu connues, en 
general il s'agit de 1' amplification de TADN par une polymerase ; lorsque 
Pechantillon d'origine est un ARN il convient prealablement d'effectuer une 
transcription reverse. II existe actuellement de tres nombreux precedes permettant 

10 cette amplification, comme par exemple la technique SDA (Strand Displacement 
Amplification) ou technique d 'amplification a deplacement de brin (Walker et al., 
1992, Nucleic Acids Res. 20 : 1691), la technique TAS (Transcription-based 
Amplification System) decrite par Kwoh et al. (1989, Proc. 'Natl. Acad. Sci. USA, 
86, 1173), la technique 3SR (Self-Sustained Sequence Replication) decrite par 

15 Guatelli et al. (1990, Proc. Natl. Acad. Sci. USA 87; 1874), la technique NASBA 
(Nucleic Acid Sequence Based Amplification) decrite par Kievitis et al. (1991, J. 
Virol. Methods, 35, 273), la technique TMA (Transcription Mediated 
Amplification), la technique LCR (Ligase Chain Reaction) decrite par Landegren et 
al. (1988, Science 241, 1077), la technique de RCR (Repair Chain Reaction) decrite 

20 par Segev (1992, Kessler C. Springer Verlag, Berlin, New- York, 197-205), la 
technique CPR (Cycling Probe Reaction) decrite par Duck et al. (1990, 
Biotechniques, 9, 142), la technique d'amplification a la Q-beta-replicase decrite 
par Miele et al. (1983, J. MoL Biol., 171, 281). Certaines de ces techniques ont 
depuis ete perfectionnees. 

25 Dans le cas ou le polynucleotide cible a detecter est un ARNm, on utilise 

avantageusement, prealablement a la mise en oeuvre d'une reaction d' amplification 
a Paide des amorces selon Finvention ou a la mise en oeuvre d'un procede de 
detection a l'aide des sondes de V invention, une enzyme de type transcriptase 
inverse afin d'obtenir un ADNc a partir de I'ARNm contenu dans Pechantillon 

30 biologique. L'ADNc obtenu servira alors de cible pour les amorces ou les sondes 
mises en oeuvre dans le procede d'amplification ou de detection selon Tinvention. 

La technique d' hybridation de sondes peut etre realisee de manieres di verses 
(Matthews et al., 1988, Anal. Biochem., 169, 1-25). La methode la plus generale 
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consiste a immobiliser Facide nucleique extrait des cellules de differents tissus ou 
de cellules en culture sur un support (tels que la nitrocellulose, le nylon, le 
polystyrene) et a incuber, dans des conditions bien definies, l'acide nucleique cible 
immobilise avec la sonde. Apres Fhybridation, Fexces de sonde est elimine et les 
5 molecules hybrides formees sont detectees par la methode appropriee (mesure de la 
radioactivite, de la fluorescence ou de Factivite enzymatique liee a la sonde). 

Seion un autre mode de mise en oeuvre des sondes nucleiques selon 
F invention, ces dernieres peuvent etre utilisees comme sondes de capture. Dans ce 
cas, une sonde, dite « sonde de capture », est immobilisee sur un support et sert a 
10 capturer par hybridation specifique Facide nucleique cible obtenu a partir de 
Fechantillon biologique a tester et Facide nucleique cible est ensuite detecte grace a 
une seconde sonde, dite « sonde de detection », marquee par un element facilement 
detectable. 

Parmi les fragments d'acides nucleiques interessants, il faut ainsi citer en 

15 particulier les oligonucleotides anti-sens, c'est-a-dire dont la structure assure, par 
hybridation avec la sequence cible, une inhibition de F expression du produit 
correspondant. II faut egalement citer les oligonucleotides sens qui, par interaction 
avec des proteines impliquees dans la regulation de Fexpression du produit 
correspondant, induiront soit une inhibition, soit une activation de cette expression. 

20 De fa?on preferee, les sondes ou amorces selon F invention sont 

immobilisees sur un support, de maniere covalente ou non covalente. En particulier, 
le support peut etre une puce a ADN ou un filtre a haute densite, egalement objets 
de la presente invention. 

On entend designer par puce a ADN ou filtre haute densite, un support sur 

25 lequel sont fixees des sequences d' ADN, chacune d'entre elles pouvant etre reperee 
par sa localisation geographique. Ces puces ou filtres different principalement par 
leur taille, le materiau du support, et eventuellement le nombre de sequences 
d'ADN qui y sont fixees. 

On peut fixer les sondes ou amorces selon la premiere invention sur des 

30 supports solides, en particulier les puces a ADN, par differents precedes de 
fabrication. En particulier, on peut effectuer une synthese in situ par adressage 
photochimique ou par jet d'encre. D'autres techniques consistent a effectuer une 
synthese ex situ et a fixer les sondes sur le support de la puce a ADN par adressage 
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mecanique, electronique ou par jet d'encre. Ces differents procedes sont bien 
connus de rhomme du metier. 

Une sequence nucleotidique (sonde ou amorce) selon P invention permet 
done la detection et/ou F amplification de sequences nucleiques specifiques. En 
5 particulier, la detection de cesdites sequences est facilitee lorsque la sonde est fixee 
sur une puce a ADN, ou a un filtre haute densite. 

L'utilisation de puces a ADN ou de filtres a haute densite permet en effet de 
determiner V expression de genes dans un organisme presentant une sequence 
genomique proche de L. monocytogenes EGD-e. 
10 La sequence genomique de L. monocytogenes EGD-e, completee par 

F identification de tous les genes de cet organisme, telle que presentee dans la 
presente invention, sert de base a la construction de ces puces a ADN ou filtre. 

La preparation de ces filtres ou puces consiste a synthetiser des 
oligonucleotides, correspondant aux extremites 5' et 3' des genes. Ces 
15 oligonucleotides sont choisis en utilisant la sequence genomique et ses annotations 
divulguees par la presente invention. La temperature d'appariement des ces 
oligonucleotides aux places correspondantes sur l'ADN doit etre 
approximativement la meme pour chaque oligonucleotide. Ceci permet de preparer 
des fragments d'ADN correspondants a chaque gene par V utilisation de condition 
20 de PGR appropriees dans un environnement hautement automatisee. Les fragments 
amplifies sont ensuite immobilises sur des filtres ou des supports en verre, silicium 
ou polymeres synthetiques et ces milieux sont utilises pour i* hybridation. 

La disponibilite de tels filtres et/ou puces et de la sequence genomique 
correspondante annotee permet d'etudier l'expression de grands ensembles, voire de 
25 la totalite des genes dans les micro-organismes associes a Listeria monocytogenes, 
en preparant les ADN complementaires, et en les hybridant a l'ADN ou aux 
oligonucleotides immobilises sur les filtres ou les puces. Egalement, les filtres et/ou 
les puces permettent d'etudier la variabilite des souches ou des especes, en preparant 
l'ADN de ces organismes et en les hybridant a l'ADN ou aux oligonucleotides 
30 immobilises sur les filtres ou les puces. 

Les differences entre les sequences genomiques des differentes souches ou 
especes peuvent grandement affecter 1'intensite de l'hybridation et, par consequent, 
perturber l'interpretation des resultats. U peut done etre necessaire d'avoir la 



WO 01/77335 



PCT/FR01/01118 



13 

sequence precise des genes de la souche que Ton souhaite etudier. La methode de 
detection des genes decrite plus loin en detail, impliquant la determination de la 
sequence de fragments aleatoires d'un genome, et les organisant d'apres la sequence 
du genome complet de Listeria monocytogenes EGD-e divulgue dans la presente 
5 invention, peut etre tres utile. 

Les sequences nucleotidiques selon Pinvention peuvent etre utilisees dans 
des puces a ADN pour effectuer F analyse de mutations. Cette analyse repose sur la 
constitution de puces capables d'analyser chaque base d'une sequence nucleotidique 
selon Tinvention. On pourra notamment a cette fin mettre en ceuvre les techniques 

10 de micro-sequengage sur puce a ADN. Les mutations sont detectees par extension 
d'amorces immobilisees hybridant a la matrice des sequences analysees, juste en 
position adjacente de celle du nucleotide mute recherche. Une matrice simpie-brin, 
ARN ou ADN, des sequences a analyser sera avantageusement preparee selon des 
methodes classiques, a partir de produits amplifies selon les techniques de type 

15 PCR. Les matrices d'ADN simple brin, ou d'ARN ainsi obtenues sont alors 
deposees sur la puce a ADN, dans des conditions permettant leur hybridation 
specifique aux amorces immobilisees. Une polymerase thermostable, par exemple 
la Tth ou la Taq ADN polymerase, etend specifiquement l'extremite 3 f de Tamorce 
immobilisee avec un analogue de nucleotide marque complementaire du nucleotide 

20 en position du site variable; par exemple un cycle thermique est realise en presence 
des dideoxyribonucleotides fluorescents. Les conditions experimentales seront 
adaptees notamment aux puces employees, aux amorces immobilisees, aux 
polymerases employees, et au systeme de marquage choisi. Un avantage du 
microsequengage, par rapport aux techniques basees sur Thybridation de sondes, est 

25 qu'il permet d'identifier tous les nucleotides variables avec une discrimination 
optimale dans des conditions de reactions homogenes; utilise sur des puces a ADN, 
il permet une resolution et une specificite optimales pour la detection routiniere et 
industrielle de mutations en multiplex. 

L 5 utilisation des filtres a haute densite et/ou des puces permet ainsi d'obtenir 

30 des connaissances nouvelles sur la regulation des genes dans les organismes 
d'importance industrielle, et en particulier les listeria propagees dans diverses 
conditions. Elle permet aussi une identification rapide des differences entre les 
genomes des souches utilisees dans de multiples applications iridustrielles. 
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En outre, une puce a ADN ou un filtre peut etre un outil extremement 
interessant pour la determination, la detection et/ou T identification d'un 
microorganisme. Ainsi, on prefere egalement les puces a ADN selon F invention qui 
contiennent en outre au moins une sequence nucleotidique d'un microorganisme 
5 autre de Listeria monocytogenes, immobilisee sur le support de ladite puce. De 
preference, le microorganisme choisi l'est parmi les bacteries du genre Listeria (ci- 
apres designees comme bacteries associees a L. monocytogenes) ou les variants de 
Listeria monocytogenes EGD-e. 

Une puce a ADN ou un filtre selon F invention est un element tres utile de 

10 certains kits ou necessaires pour la detection et/ou 1 5 identification de 
microorganismes, en particulier les bacteries appartenant a Fespece Listeria 
monocytogenes ou les microorganismes associes, egalement objets de Finvention. 

Par ailleurs, les puces a ADN ou les filtres selon F invention, contenant des 
sondes ou amorces specifiques de Listeria monocytogenes, sont des elements tres 

15 avantageux de kits ou necessaires pour la detection et/ou la quantification de 
F expression de genes de Listeria monocytogenes (ou de microorganismes associes). 

En effet, le controle de Texpression des genes est un point critique pour 
optimiser la croissance et le rendement d'une souche, soit en permettant Fexpression 
d'un ou plusieurs genes nouveaux, soit en modifiant Texpression de genes deja 

20 presents dans la cellule. La presente invention fournit Tensemble des sequences 
naturellement actives chez L. monocytogenes permettant Texpression des genes. 
Elle permet ainsi la determination de l'ensemble des sequences exprimees chez L. 
monocytogenes. Elle fournit egalement un outil permettant de reperer les genes dont 
l'expression suit un schema donne. Pour realiser cela, FADN de tout ou partie des 

25 genes de L. monocytogenes peut etre amplifie grace a des amorces selon Tinvention, 
puis fixe a un support comme par exemple le verre ou le nylon ou une puce a ADN, 
afin de construire un outil permettant de suivre le profil d'expression de ces genes. 
Get outil, constitue de ce support contenant les sequences codantes sert de matrice 
d'hybridation a un melange de molecules marquees refletant les ARN messagers 

30 exprimes dans la cellule (en particulier les sondes marquees selon Finvention). En 
repetant cette experience a differents instants et en combinant Tensemble de ces 
donnees par un traitement approprie, on obtient alors les profils d'expression de 
Tensemble de ces genes. La connaissance des sequences qui suivent un schema de 
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regulation donnee peut aussi etre mise a profit pour rechercher de maniere dirigee, 
par exemple par homologie, d'autres sequences suivant globalement, mais de 
maniere legerement differente le meme schema de regulation. En complement, il est 
possible d'isoler chaque sequence de controle presente en amont des segments 
5 servant de sondes et d'en suivre l'activite a l'aide de moyen approprie comme un 
gene rapporteur (luciferase, (3-galactosidase, GFP pour « Green Fluorescent 
Protein »). Ces sequences isolees peuvent ensuite etre modifiees et assemblies par 
ingenierie metabolique avec des sequences d'interet en vue de leur expression 
optimale. 

10 Grace a la sequence genomique presentee dans la presente invention, 

rhomme du metier saura identifier les genes codant pour des proteines regulant la 
transcription des genes de L. monocytogenes. Par ailleurs, le tableau I fournit la liste 
des phases ouvertes de lecture (ORF pour « Open Reading Frame) identifies sur le 
genome de Listeria monocytogenes (SEQ ID N° 1), avec leur position sur ledit 

15 genome, et les fonctions putatives qui peuvent leur etre attributes. Toutefois, 
une telle liste ne doit pas etre consideree comme limitative, une proteine pouvant 
faire avoir plusieurs roles dans la cellule. 

Modifier la structure ou l'integrite de ces genes pourra permettre de modifier 
l'expression des genes cibles controles par des promoteurs cibles de ces regulateurs. 

20 Ainsi, rhomme du metier pourra choisir le ou les regulateurs pertinents pour 
^application recherchee ainsi que leur cible, ce qui permet 1 5 optimisation de 
l'expression de genes d'interet. lAitilisation des outils precedemment decrits tels les 
puces a ADN, permet aussi de reperer Tensemble des genes dont la regulation est 
modifiee par Tinactivation de certains genes. II est ainsi possible de selectionner un 

25 ensemble de sequence de controle repondant, a des nuances pres, a un meme type 
de regulation. Ces sequences peuvent etre alors utilisees pour controler l'expression 
de genes d'interet. 

U invention concerne egalement les polypeptides codes par une sequence 
nucleotidique selon Finvention, de preference, par un fragment representatif de la 
30 sequence SEQ ID N° 1 et correspondant a une sequence ORF, telle que decrite dans 
le tableau L En particulier, les polypeptides de Listeria monocytogenes caracterises 
en ce qu'ils sont choisis parmi les polypeptides de sequence SEQ ID N° 2 a SEQ ID 
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N° 2854, de preference de sequence SEQ ID N° 2 a SEQ ID N° 41 et SEQ ID N° 42 
a SEQ ED N° 64 sont objet de 1' invention. 

L'invention comprend egalement les polypeptides caracterises en ce qu'ils 
comprennent un polypeptide choisi parmi : 
5 a) un polypeptide selon Pinvention ; 

b) un polypeptide presentant au moins 80 % de preference 85 %, 90 %, 95 
% et 98 % d'identite avec un polypeptide selon Finvention ; 

c) un fragment d'au moins 5 acides amines d'un polypeptide selon 
P invention, ou tel que defini en b) ; 

10 d) un fragment biologiquement actif d'un polypeptide selon Pinvention, ou 

tel que defini en b) ou c) ; et 
e) un polypeptide modifie d'un polypeptide selon Pinvention, ou tel que 

defini en b), c) ou d). 
Les sequences nucleotidiques codant pour les polypeptides decrits 
1 5 precedemment sont egalement objet de Pinvention. 

Dans la presente description, les termes polypeptides, sequences 
polypeptidiques, peptides et proteines sont interchangeables. 

II doit etre compris que Pinvention ne concerne pas les polypeptides sous 
forme naturelle, c'est-a-dire qu'ils ne sont pas pris dans leur environnement naturel 
20 mais qu'ils ont pu etre isoles ou obtenus par purification a partir de sources 
naturelles, ou bien obtenus par recombinaison genetique, ou par synthese chimique, 
et qu'ils peuvent alors comporter des acides amines non naturels comme cela sera 
decrit plus loin. 

Par polypeptide presentant un certain pourcentage d'identite avec un autre, 
25 que Pon designera egalement par polypeptide homologue, on entend designer les 
polypeptides presentant par rapport aux polypeptides naturels, certaines 
modifications, en particulier une deletion, addition ou substitution d'au moins un 
acide amine, une troncation, un allongement, une solution chimerique et/ou une 
mutation, ou les polypeptides presentant des modifications post-traductionnelles. 
30 Parmi les polypeptides homologues, on prefere ceux dont la sequence d' acides 
amines presentent au moins 80 %, de preference 85 %, 90 %, 95 % et 98 % 
d'homologie avec les sequences d'acides amines des polypeptides selon Pinvention. 
Dans le cas d'une substitution, un ou plusieurs acide(s) amine(s) consecutif(s) ou 
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non consecutif(s) sont remplaces par des acides amines « equivalents ». 
L' expression « acides amines equivalents » vise ici a designer tout acide amine 
susceptible d'etre substitue a Tun des acides amines de la structure de base sans 
cependant modifier essentiellement les activites biologiques des peptides 

r 

5 correspondant et telles qu'elles seront defmies par la suite. 

Ces acides amines equivalents peuvent etre determines soit en s'appuyant 
sur leur homologie de structure avec les acides amines auxquels ils se substituent, 
soit sur des resultats d'essais comparatifs d'activite biologique entre les differents 
polypeptides susceptibles d'etre effectues. 

10 A titre d'exemple, on mentionne les possibilites de substitution susceptibles 

d'etre effectuees sans qu'il resulte en une modification approfondie de 1'activite 
biologique du polypeptide modifie correspondant. On peut remplacer ainsi la 
leucine par la valine ou l'isoleucine, l'acide aspartique par Tacide glutamine, la 
glutamine par l'asparagine, l'arginine par la lysine, etc... les substitutions inverses 

1 5 etant naturellement envisageables dans les memes conditions. 

Les polypeptides homologues correspondent egalement aux polypeptides 
codes par les sequences nucleotidiques homologues ou identiques, telles que 
definies precedemment et comprennent ainsi dans la presente definition des 
polypeptides mutes ou correspondant a des variations inter ou intra especes, 

20 pouvant exister chez Listeria, et qui correspondent notamment a des troncatures, 
substitutions, deletions et/ou additions, d'au moins un residu d' acides amines. 

II est entendu que Ton calcule le pourcentage d'identite entre deux 
polypeptides de la meme fagon qu'entre deux sequences d'acides nucleiques. Ainsi, 
le pourcentage d'identite entre deux polypeptides est calcule apres alignement 

25 optimal de ces deux sequences, sur une fenetre d'homologie maximale. Pour definir 
ladite fenetre d'homologie maximale, on peut utiliser les memes algorithmes que 
pour les sequences d' acide nucleique. 

Par fragment biologiquement actif d'un polypeptide selon 1' invention, on 
entend designer en particulier un fragment de polypeptide, tel que defini ci-apres, 

30 presentant au moins une des caracteristiques biologiques des polypeptides selon 
l'invention, notamment en ce qu'il est capable d'exercer de maniere generate une 
activite meme partielle, tel que par exemple : 
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- une activite enzymatique (metabolique) ou une activite pouvant etre 
impliquee dans la biosynthese ou la biodegradation de composes 
organiques ou inorganiques ; 

- une activite structurelle (enveloppe cellulaire, molecule chaperonne, 
5 ribosome) ; 

- une activite de transport (d'energie, d'ion) ; ou dans la secretion de 
proteine ; 

- une activite dans le processus de replication, amplification, preparation, 
transcription, traduction ou maturation, notamment de PADN, de VARN 

10 ou des proteines. 

Par fragment de polypeptides selon P invention, on entend designer un 
polypeptide comportant au minimum 5 acides amines, de preference 10, 15, 25, 50, 
100 et 150 acides amines. 

Les fragments de polypeptides peuvent correspondre a des fragments isoles 

15 ou purifies naturellement presents dans les souches de Listeria, ou a des fragments 
qui peuvent etre obtenus par clivage dudit polypeptide par une enzyme proteolitique 
telle que la trypsine ou la chymotrypsine ou la collagenase, par un reactif chimique 
(bromure de cyanogene, CNBr) ou en plagant ledit polypeptide dans un 
environnement tres acide (par exemple a pH = 2,5). Des fragments polypeptidiques 

20 peuvent egalement etre prepares par synthese chimique, a partir d'hotes transformes 
par un vecteur d'expression selon Pinvention qui contiennent un acide nucleique 
permettant Pexpression dudit fragment, et place sous le controle des elements de 
regulation et/ou d'expression appropries. 

Par « polypeptide modifie » d'un polypeptide selon Pinvention, on entend 

25 designer un polypeptide obtenu par recombinaison genetique ou par synthese 
chimique comme decrit plus loin, qui presente au moins une modification par 
rapport a la sequence normale, de preference au plus 10 % d'acides amines modifies 
par rapport a la sequence normale. Ces modifications peuvent etre notamment 
portees sur des acides amines necessaires pour la specificite ou Pefficacite de 

30 P activite, ou a Porigine de la conformation structural e, de la charge, ou de 
Phydrophobicite du polypeptide selon Pinvention. On peut ainsi creer des 
polypeptides d'activite equivalents augmentee ou diminuee, ou de specificite 
equivalent^ plus etroite ou plus large. Parmi les polypeptides modifies, il faut citer 
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les polypeptides dans lesquels jusqu'a cinq acides amines peuvent etre modifies, 
tronques a Fextremite N ou C-terminale, ou bien deletes, ou ajoutes. 

Comme cela est indique, les modifications d'un polypeptide ont pour 
object if notamment : 

5 -de permettre sa mise en oeuvre dans des procedes de biosynthese ou de 

biodegradation de composes organiques ou inorganiques, 
de permettre sa mise en oeuvre dans des procedes de replication, 
d 5 amplification, de reparation et regie de transcription, de traduction, ou 
de maturation notamment de F ADN, F ARN, ou de proteines, 
10 -de permettre sa secretion amelioree, 

de modifier sa solubilite, Fefficacite ou la specificite de son activite, ou 
encore de faciliter sa purification. 
La synthese chimique presente egalement Favantage de pouvoir utiliser des 
acides amines non naturels ou des liaisons non peptidiques. Ainsi, il peut etre 
1 5 interessant d'utiliser des acides amines non naturels, par exemple sous forme D, ou 
des analogues d' acides amines, notamment des formes souffrees. 

La presente invention fournit la sequence nucleotidique du genome de 
Listeria monocytogenes EGD-e, ainsi que certaines sequences polypeptidiques. 
L'homme du metier pourra determiner les autres ORFs, en utilisant des methodes 
20 connues, et des logiciels appropries. 

Parmi les genes identifies dans la sequence genomique de L. 
monocytogenes, on peut citer en particulier les genes impliques dans la biosynthese 
de la vitamine B12 (SEQ ID N° 42 a SEQ ID N° 64). Cette bacterie est ainsi 
capable de synthetiser naturellement cette vitamine, et la connaissance des genes 
25 menant a leur synthese permet a Thomme du metier d'optimiser Texpression de ces 
genes ou de les modifier en vue d'augmenter la production de cette vitamine. Ainsi, 
la presente invention a egalement pour objet un procede de production de la 
vitamine B12, caracterise en ce que Fon fournit le substrat de depart a une cellule 
hote contenant les genes correspondant a SEQ ID N° 42 a SEQ ID N° 64, que Fon 
30 la cultive dans les conditions appropriees pour la production de vitamine B12, et 
que Fon recupere ladite vitamine. De preference, la cellule hote est une cellule 
bacterienne, de fa<?on plus preferee, une bacterie du genre Bacillus ou Listeria. Un 
procede de production de la vitamine B!2 utilisant une sequence nucleique, ou 
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polypeptidique selon F invention, ou une cellule hote selon T invention ou un animal 
ou un vegetal selon T invention est egalement un objet de la presente invention. 

De maniere generate, la liste des sequences SEQ ID, ou leur sequence 
I nucleique codante correspondante^ pourra etre determinee par Fhomme de Tart a 
5 parti des fonctions putatives les plus probables determinees pour chacune des 
sequences SEQ ED dans le tableau 1 ci-apres pour chacune des classes d'activite 
repertorie ci-apres. 

Ainsi et de maniere preferee, T invention est relative a une sequence 

nucleotidique selon Tinvention caracterisees en ce qu'elle code pour un polypeptide 
10 de Listeria monocytogenes ou un de ses fragments implique dans la biosynthese de 

la vitamine B12. De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 

42 a SEQ ID N° 64. 

De maniere preferee, 1' invention est relative a une sequence nucleotidique 

selon Tinvention caracterisees en ce qu'elle code pour un polypeptide d'enveloppe 
1 5 cellulaire ou present a la surface de Listeria monocytogenes ou un de ses fragments. 

De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 2 a SEQ ID N° 

41. 

De maniere preferee, Tinvention est relative a une sequence nucleotidique 
selon Tinvention, caracterisees en ce qu'elie code pour un polypeptide de Listeria 
20 monocytogenes ou un de ses fragments implique dans la biosynthese des acides 
amines. 

De maniere preferee, Tinvention est relative a une sequence nucleotidique 
selon Tinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans la biosynthese des cofacteurs, 
25 groupes prosthetiques et transporters. 

De maniere preferee, Tinvention est relative a une sequence nucleotidique 
selon Tinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans la machinerie cellulaire. 

De maniere preferee, Tinvention est relative a une sequence nucleotidique 
30 selon Tinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le metabolisme intermediate 
central. 
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De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon 1' invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le metabolisme energetique. 

De maniere preferee, T invention est relative a une sequence nucleotidique 
5 selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le metabolisme des acides 
gras et des phospholipides. 

De maniere preferee, V invention est relative a une sequence nucleotidique 
selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
10 monocytogenes ou un de ses fragments implique dans le metabolisme des 
nucleotides, des purines, des pyrimidines ou nucleosides. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans les fonctions de regulation. 
15 De maniere preferee, Finvention est relative a une sequence nucleotidique 

selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le processus de replication. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
20 monocytogenes ou un de ses fragments implique dans le processus de transcription. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le processus de traduction. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
25 selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le processus de transport et de 
liaison des proteines. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
30 monocytogenes ou un de ses fragments implique dans Fadaptation aux conditions 
atypiques. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon {'invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
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monocytogenes ou un de ses fragments dans la sensibilite aux medicaments et 
analogues. 

De maniere preferee, 1' invention est relative a une sequence nucleotidique 
selon l'invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
5 monocytogenes ou un de ses fragments implique dans les fonctions relatives aux 
transposons. 

De maniere preferee, l'invention est relative a une sequence nucleotidique 
selon T invention caracterisees en ce qu'elle code pour un polypeptide specifique de 
Listeria monocytogenes ou un de ses fragments. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la biosynthese de la 
vitamine B12. De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 42 
a SEQ ID N° 64. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide 
d'enveloppe cellulaire ou de surface de Listeria monocytogenes ou un de ses 
fragments. De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 2 a 
SEQ ID N° 41. 

Sous un autre aspect, de maniere preferee, {'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la biosynthese des 
acides amines. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la biosynthese des 
cofacteurs, groupes prosthetiques et transporters. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la machinerie 
cellulaire. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
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Listeria monocytogenes ou un de ses fragments implique dans le metabolisme 
intermediate central. 

Sous un autre aspect, de maniere preferee, 1' invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
5 Listeria monocytogenes ou un de ses fragments implique dans le metabolisme 
energetique. 

Sous un autre aspect, de maniere preferee, {'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le metabolisme des 

1 0 acides gras et des phospholipides. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le metabolisme des 
nucleotides, des purines, des pyrimidines ou nucleosides. 

15 Sous un autre aspect, de maniere preferee, l'invention a pour objet un 

polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans les fonctions de 
regulation. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
20 polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le processus de 
replication. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
25 Listeria monocytogenes ou un de ses fragments implique dans le processus de 
transcription. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le processus de 
30 traduction. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
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Listeria monocytogenes ou un de ses fragments implique dans le processus de 
transport et de liaison des proteines. 

Sous un autre aspect, de maniere preferee, r invention a pour objet un 
polypeptide selon F invention, caracterise en ce qu'il s'agit d'un polypeptide de 
5 Listeria monocytogenes ou un de ses fragments implique dans F adaptation aux 
conditions atypiques. 

Sous un autre aspect, de maniere preferee, F invention a pour objet un 
polypeptide selon F invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments dans la sensibilite aux medicaments 
10 et analogues. 

Sous un autre aspect, de maniere preferee, Finvention a pour objet un 
polypeptide selon Finvention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans les fonctions 
relatives aux transposons. 

15 Sous un autre aspect, de maniere preferee, Finvention a pour objet un 

polypeptide selon Finvention, caracterise en ce qu'il s'agit d'un polypeptide 
specifique de Listeria monocytogenes ou un de ses fragments. 

II est important de noter toutefois qu'un organisme vivant est un tout et doit 
etre pris comme tel. Ainsi, afin de pouvoir se developper et d'exhiber ses proprietes, 

20 tout organisme a besoin d' interactions entre les differentes voies metaboliques. 
Ainsi, la classification enoncee ci-dessus ne doit pas etre consideree comme 
limitative, un gene pouvant etre implique dans deux voies metaboliques distinctes. 

La presente invention a egalement pour objet les sequences nucleotidiques 
et/ou de polypeptides selon Finvention, caracterisees en ce que lesdites sequences 

25 sont enregistrees sur un support d'enregistrement dont la forme et la nature 
facilitent la lecture, Fanalyse et/ou Fexploitation de ladite ou desdites sequence(s). 
Ces supports peuvent egalement contenir d'autres informations extraites de la 
presente invention, notamrnent les analogies avec des sequences deja connues, et/ou 
des informations concernant les sequences nucleotidiques et/ou de polypeptides 

30 d'autres microorganismes afin de faciliter Fanalyse comparative et Fexploitation 
des resultats obtenus. 

Parmi cesdits supports d'enregistrement, on prefere en particulier les 
supports lisibles par un ordinateur, tels les supports magnetiques, optiques, 



WO 01/77335 



PCT/FR01/01118 



25 

electriques ou hybrides, en particulier les disquettes informatiques, les CD-ROM, 
les serveurs informatiques. De tels supports d'enregistrement sont egalement objet 
de T invention. 

Les supports d'enregistrement selon Finvention, avec les informations 
5 apportees, sont tres utiles pour le choix d' amorces ou de sondes nucleotidiques pour 
la determination de genes dans Listeria monocytogenes ou souches proches de cet 
organisme. De meme, Putilisation de ces supports pour F etude du polymorphisme 
genetique de souche proche de Listeria monocytogenes, en particulier par la 
determination des regions de coiinearite, est tres utile dans la mesure ou ces 

10 supports fournissent non seulement la sequence nucleotidique du genome de 
Listeria monocytogenes egb, mais egalement F organisation genomique dans ladite 
sequence. Ainsi, les utilisations de supports d'enregistrement selon Finvention sont 
egalement des objets de Finvention. 

L' analyse d'homologie entre differentes sequences s'efFectue en effet 

15 avantageusement a Faide de logiciels de comparaisons de sequences, tels le logiciel 
Blast, ou les logiciels de la trousse GCG, decrits precedemment 

L' invention vise egalement les vecteurs de clonage et/ou d' expression, qui 
contientiQnt une sequence nucleotidique selon Finvention. On prefere en particulier, 
les sequences nucleotidiques codant pour des polypeptides d'enveloppe cellulaire 

20 ou de surface, ou impliques dans la machinerie cellulaire, en particulier la secretion, 
le metabolisme intermediate central, en particulier la production de sucre, le 
metabolisme energetique, les processus de synthese de la vitamine B12, de 
transcription et de traduction, de synthese de polypeptides. 

Les vecteurs selon Finvention comportent de preference des elements qui 

25 permettent F expression et/ou la secretion des sequences nucleotidiques dans une 
cellule hote determinee. 

Le vecteur doit alors comporter un promoteur, des signaux d' initiation et de 
terminaison de la traduction, ainsi que des regions appropriees de regulation de la 
transcription. II doit pouvoir etre maintenu de fa$on stable dans la cellule bote et 

30 peut eventuellement posseder des signaux particuliers qui specifient la secretion de 
la proteine traduite. Ces difFerents elements sont choisis et optimises par Fhomme 
du metier en fonction de Fhote cellulaire utilise. A cet effet, les sequences 
nucleotidiques selon Finvention peuvent etre inserees dans des vecteurs a 
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replication autonome au sein de Phote choisi, ou etre des vecteurs integratifs de 
Phote choisi. 

De tels vecteurs sont prepares par des methodes couramment utilisees par 
rhomme du metier, et les clones resultant peuvent etre introduits dans un hote 
5 approprie par des methodes standards, telle que la lipofection, Peiectroporation, le 
choc thermique, ou des methodes chimiques. 

Les vecteurs selon P invention sont par exemple des vecteurs d'origine 
plasmidique ou virale. lis sont utiles pour transformer des cellules hotes afin de 
doner ou d'exprimer les sequences nucleotidiques selon Pinvention. 
10 L'invention comprend egalement les cellules hotes transformees par un 

vecteur selon Pinvention. 

L'hote cellulaire peut etre choisi parmi des systemes procaryotes ou 
eucaryotes, par exemple les cellules bacteriennes mais egalement les cellules de 
levure ou les cellules animates, en particulier les cellules de mammiferes. On peut 
15 egalement utiliser des cellules d'insectes ou des cellules de plantes. Les cellules 
hotes preferees selon Pinvention sont en particulier les cellules procaryotes, de 
preference les bacteries appartenant au genre Listeria, a Pespece Listeria 
monocytogenes, ou les microorganismes associes a Pespece Listeria 
monocytogenes. L'invention concerne egalement les animaux, excepte Phomme, 
20 qui comprennent une cellule transformee selon Pinvention. Les cellules 
transformees selon Pinvention sont utilisables dans des precedes de preparation de 
polypeptides recombinants selon Pinvention. Les procedes de preparation d'un 
polypeptide selon Pinvention sous forme recombinante, caracterises en ce qu'ils 
mettent en oeuvre un vecteur et/ou une cellule transformee par un vecteur selon 
25 Pinvention sont eux-memes compris dans la presente invention. De preference, on 
cultive une cellule transformee par un vecteur selon Pinvention dans des conditions 
qui permettent Pexpression dudit polypeptide et on recupere ledit peptide 
recombinant. Les cellules hotes selon Pinvention peuvent egalement etre utilisees 
pour la preparation de compositions alimentaires, qui sont elles-memes objet de la 
30 presente invention. 

Ainsi qu'il a ete dit, Phote cellulaire peut etre choisi parmi des systemes 
procaryotes ou eucaryotes. En particulier, il est possible d' identifier des sequences 
nucleotidiques selon Pinvention, facilitant la secretion dans un tel systeme 
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procaryote ou eucaryote. Un vecteur selon V invention portant une telle sequence 
peut done etre avantageusement utilise pour la production de proteines 
recombinantes, destinees a etre secretees. En effet, la purification de ces proteines 
recombinantes d'interet sera facilite par le fait qu'elles sont presentent dans le 
5 surnageant de la culture celiulaire plutot qu'a l'interieur des cellules hotes. 

On peut egalement preparer les polypeptides selon F invention par synthese 
chimique. Un tel procede de preparation est egalement un objet de T invention. 
L'homme du metier connatt les precedes de synthese chimique, par exemple les 
techniques mettant en oeuvre des phases solides (voir notamment Steward et al, 

10 1984, Solid phase peptides synthesis, Pierce Chem. Company, Rockford, 111, 2eme 
ed., (1984)) ou des techniques utiiisant des phases solides partielles, par 
condensation de fragments ou par une synthese en solution classique. Les 
polypeptides obtenus par synthese chimique et pouvant comporter des acides 
amines non naturels correspondant sont egalement compris dans V invention. 

15 L'invention est en outre relative a des polypeptides hybrides presentant au 

moins un polypeptide ou un de ses fragments selon T invention, et une sequence 
d'un polypeptide susceptible d'induire une reponse immunitaire chez Fhomme ou 
1' animal. 

Avantageusement, le determinant antigenique est tel qu'il est susceptible 
20 d'induire une reponse humorale et/ou celiulaire. 

Un tel determinant pourra comprendre un polypeptide ou un de ses 
fragments selon l'invention sous forme glycosylee utilise en vue d'obtenir des 
compositions immunogenes susceptibles d'induire la synthese d'anticorps diriges 
contre des epitopes multiples. Lesdits polypeptides ou leurs fragments glycosyles 
25 font egalement partie de l'invention. 

Ces molecules hybrides peuvent etre constitutes en partie d'une molecule 
porteuse de polypeptides ou de leurs fragments selon l'invention, associee a une 
partie eventuellement immunogene, en particulier un epitope de la toxine 
diphterique, la toxine tetanique, un antigene de surface du virus de V hepatite B 
30 (brevet FR 79 21811), 1' antigene VP1 du virus de la poliomyelite ou toute autre 
toxine ou antigene viral ou bacterien. 

Les precedes de synthese des molecules hybrides englobent les methodes 
utilisees en genie genetique pour construire des sequences nucleotidiques hybrides 
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codant pour les sequences polypeptidiques recherchees. On pourra, par exemple, se 
referer avantageusement a la technique d'obtention de genes codant pour des 
proteines de fusion decrite par Minton en 1984. 

Lesdites sequences nucleotidiques hybrides codant pour un polypeptide 
5 hybride ainsi que les polypeptides hybrides selon 1' invention caracterises en ce qu'il 
s'agit de polypeptides recombinants obtenus par l'expression desdites sequences 
nucleotidiques hybrides, font egalement partie de 1' invention. 

L'invention comprend egalement les vecteurs caracterises en ce qu'ils 
contiennent une desdites sequences nucleotidiques hybrides. Les cellules hotes 
10 transformees par lesdits vecteurs, les animaux transgeniques comprenant une 
desdites cellules transformees ainsi que les procedes de preparation de polypeptides 
recombinants utilisant lesdits vecteurs, lesdites cellules transformees et/ou lesdits 
animaux transgeniques font bien entendu egalement partie de l'invention. 

Le couplage entre un polypeptide selon l'invention et un polypeptide 
1 5 immunogene, peut etre effectue par voie chimique, ou par voie biologique. Ainsi, 
selon l'invention, il est possible d'introduire un ou plusieurs element(s) de liaison, 
notamment des acides amines pour faciliter les reactions de couplage entre le 
polypeptide selon l'invention, et le polypeptide immunostimulateur, le couplage 
covalent de l'antigene immunostimulateur pouvant etre realise a l'extremite N ou 
20 C-terminale du polypeptide selon l'invention. Les reactifs bifonctionnels permettant 
ce couplage sont determines en fonction de l'extremite choisie pour realiser ce 
couplage, et les techniques de couplage sont bien connues de l'homme du metier. 

Les conjugues issus d'un couplage de peptides peuvent etre egalement 
prepares par recombinaison genetique. Le peptide hybride (conjugue) peut en effet 
25 etre produit par des techniques d'ADN recombinant, par insertion ou addition a la 
sequence d'ADN codant pour le polypeptide selon l'invention, d'une sequence 
codant pour le ou les peptide(s) antigene(s), immunogene(s) ou haptene(s). Ces 
techniques de preparation de peptides hybrides par recombinaison genetique sont 
bien connues de l'homme du metier (voir par exemple Makrides, 1996, 
30 Microbiological Reviews 60, 5 12-53 8). 

De preference, ledit polypeptide immunitaire est choisi dans le groupe des 
peptides contenant les anatoxines, notamment le toxolde diphterique ou le toxoide 
tetanique, les proteines derivees du Streptocoque (comme la proteine de liaison a la 
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seralbumine humaine), les proteines membranaires OMPA et les complexes de 
proteines de membranes externes, les vesicules de membranes externes ou les 
proteines de chocs thermiques. 

Les polypeptides hybrides selon 1' invention sont tres utiles pour obtenir des 
5 anticorps monoclonaux ou polyclonaux, capables de reconnaitre specifiquement les 
polypeptides selon Tinvention. En effet, un polypeptide hybride selon Tinvention 
permet la potentiation de la reponse immunitaire, contre le polypeptide selon 
Tinvention couple a la molecule immunogene. De tels anticorps monoclonaux ou 
polyclonaux, leurs fragments, ou les anticorps chimeriques, reconnaissant les 
1 0 polypeptides selon Tinvention, sont egalement objets de Tinvention. 

Les anticorps monoclonaux specifiques peuvent etre obtenus selon la 
methode classique de culture d'hybridome decrite par Kohler et Mlstein (1975, 
Nature 256, 495). 

Les anticorps selon Tinvention sont par exemple des anticorps chimeriques, 
15 des anticorps humanises, des fragments Fab, ou F(ab') 2 . II peut egalement se 
presenter sous forme d'irnmunoconjugue ou d'anticorps marque afin d'obtenir un 
signal detectable et/ou quantifiable. 

Ainsi, les anticorps selon Tinvention peuvent etre employes dans un procede 
pour la detection et/ou T identification de bacteries appartenant a Tespece Listeria 
20 monocytogenes ou a un microorganisme associe dans un echantillon biologique, 
caracterise en ce qu'il comprend les etapes suivantes ; 

a) mise en contact de T echantillon biologique avec un anticorps selon 
Tinvention ; 

b) mise en evidence du complexe antigene-anticorps eventuellement forme. 
25 Les anticorps selon la presente invention sont egalement utilisables afin de 

detecter une expression d'un gene de Listeria monocytogenes ou de 
microorganismes associes. En effet, la presence du produit d'expression d'un gene 
reconnu par un anticorps specifique dudit produit expression peut etre detectee par 
la presence d'un complexe antigene-anticorps forme apres la mise en contact de la 
30 souche de Listeria monocytogenes ou du microorganisme associe avec un anticorps 
selon Tinvention. La souche bacterienne utilisee peut avoir ete « preparee », c'est-a- 
dire centrifugee, lysee, placee dans un reactif approprie pour la constitution du 
milieu propice a la reaction immunologique. En particulier, on prefere un procede 
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de detection de P expression dans le gene, correspondant a un Western blot, pouvant 
etre effectue apres une electrophorese sur gel de polyacrylamide d'un lysat de la 
souche bacterienne, en presence ou en P absence de conditions reductrices (SDS- 
PAGE). Apres migration et separation des proteines sur le gel de polyacrylamide, 
5 on transfere lesdites proteines sur une membrane appropriee (par exemple en nylon) 
et on detecte la presence de la proteine ou du polypeptide d'interet, par mise en 
contact de ladite membrane avec un anticorps selon P invention. 

Ainsi, la presente invention comprend egalement les kits ou necessaires pour 
la mise en oeuvre d'un procede tel que decrit (de detection de I'expression d'un gene 
10 de Listeria monocytogenes ou d'un microorganisme associe, ou pour la detection 
et/ou P identification de bacteries appartenant a Pespece Listeria monocytogenes ou 
un microorganisme associe), comprenant les elements suivants : 

a) un anticorps polyclonal ou monoclonal selon P invention ; 

b) eventuellement, les reactifs pour la constitution du milieu propice a la 
1 5 reaction immunologique ; 

c) eventuellement, les reactifs permettant la mise en evidence des 
complexes antigene-anticorps produits par la reaction immunologique. 

Les polypeptides et les anticorps selon Pinvention peuvent avantageusement 
etre immobilises sur un support, notamment une puce a proteines. Une telle puce a 

20 proteines est un objet de Pinvention, et peut egalement contenir au moins un 
polypeptide d'un microorganisme autre que Listeria monocytogenes ou un anticorps 
dirige contre un compose d'un microorganisme autre que Listeria monocytogenes. 

Les puces a proteines ou filtres a haute densite contenant des proteines selon 
Pinvention peuvent etre construits de la meme maniere que les puces a ADN selon 

25 Pinvention. En pratique, on peut effectuer la synthese des polypeptides fixes 
directement sur la puce a proteines, ou effectuer une synthese ex situ suivie d'une 
etape de fixation du polypeptide synthetise sur ladite puce. Cette derniere methode 
est preferable, lorsque Pen desire fixer des proteines de taille importante sur le 
support, qui sont avantageusement preparees par genie genetique, Toutefois, si Pon 

30 ne desire fixer que des peptides sur le support de ladite puce, il peut etre plus 
interessant de proceder a la synthese desdits peptides directement in situ. 

Les puces a proteines selon Pinvention peuvent etre avantageusement 
utilisees dans des kits ou necessaires pour la detection et/ou P identification de 
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bacteries associees a Pespece Listeria monocytogenes ou a un microorganisme, ou 
de fa?on plus generate dans des kits ou necessaires pour la detection et/ou 
P identification de microorganismes. Lorsque Pon fixe les polypeptides selon 
Tinvention sur les puces a ADN, on recherche la presence d'anticorps dans les 
5 echantillons testes, la fixation d'un anticorps selon Tinvention sur le support de la 
puce a proteines permettant P identification de la proteine dont ledit anticorps est 
specifique. 

De preference, on fixe un anticorps selon P invention sur le support de la 
puce a proteines, et on detecte la presence de Pantigene correspondant, specifique 
10 de Listeria monocytogenes ou d'un microorganisme associe. 

Une puce a proteines ci-dessus decrite peut etre utilisee pour la detection de 
produits de genes, pour etablir un profil d 5 expression desdits genes, en complement 
d'une puce a ADN selon P invention. 

Les puces a proteines selon P invention sont egalement extremement utiles 
15 pour les experiences de proteomique, qui etudie les interactions entre les differentes 
proteines d'un microorganisme donne. De fa?on simplifiee, on fixe des peptides 
representatifs des differentes proteines d'un organisme sur un support. Puis, on met 
ledit support en contact avec des proteines marquees, et apres une etape optionnelle 
de ringage, on detecte des interactions entre lesdites proteines marquees et les 
20 peptides fixes sur la puce a proteines. 

Ainsi, les puces a proteines comprenant une sequence polypeptidique selon 
Pinvention ou un anticorps selon Pinvention sont objet de Pinvention, ainsi que les 
kits ou necessaires les contenant 

La presente invention couvre egalement un procede de detection et/ou 
25 d' identification de bacteries appartenant a Pespece Listeria monocytogenes ou a un 
microorganisme associe dans un echantillon biologique, qui met en oeuvre une 
sequence nucleotidique selon 1' invention. 

II doit etre entendu que le terme echantillon biologique concerne dans la 
presente invention les echantillons preleves a partir d'un organisme vivant (en 
30 particulier sang, tissus, organes ou autres preleves a partir d'un mammifere) ou un 
echantillon contenant du materiel biologique, c'est-a-dire de PADN. Un tel 
echantillon biologique englobe done les compositions alimentaires contenant des 
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bacteries (par exempie les fromages, les produits laitiers), mais egalement des 
compositions alimentaires contenant des levures (bieres, pains) ou autres. 

Le precede de detection et/ou d' identification mettant en reuvre les 
sequences nucleotidiques selon Pinvention peut etre de diverse nature. 

5 

On prefere un precede comportant les etapes suivantes : 

a) eventuellement, isolement de l'ADN a partir de l'echantillon biologique a 

analyser, ou obtention d'un ADNc a partir de TARN de l'echantillon 

biologique ; 

10 b ) amplification specifique de l'ADN de bacteries appartenant a Pespece 

Listeria monocytogenes ou a un micro-organisme associe a 1'aide d'au 
moins une amorce selon 1' invention ; 
c) mise en evidence des produits d'amplification. 

Ce precede est base sur P amplification specifique de l'ADN, en particulier 
1 5 par une reaction d'amplification en chaine. 

On prefere egalement un precede comprenant les etapes suivantes : 

a) mise en contact d'une sonde nucleotidique selon l'invention avec un 
echantillon biologique, Pacide nucleique contenu dans l'echantillon 
biologique ayant, le cas echeant, prealablement ete rendu accessible a 

20 T hybridation, dans des conditions permettant l' hybridation de la sonde a 

l'acide nucleique d'une bacterie appartenant a l'espece Listeria 
monocytogenes ou a un micro-organisme associe ; 

b) mise en evidence de l'hybride eventuellement forme entre la sonde 
nucleotidique et l'ADN de l'echantillon biologique. 

25 Un tel precede ne doit pas etre limite a la detection de la presence de l'ADN 

contenu dans l'echantillon biologique atteste, il peut etre egalement mis en oeuvre 
pour detecter PARN contenu dans ledit echantillon. Ce precede englobe en 
particulier les Southern et Northern blot. 

Un autre precede prefere selon l'invention comprend les etapes suivantes : 
50 a ) mise en contact d'une sonde nucleotidique immobilisee sur un support 

selon l'invention avec un echantillon biologique, Pacide nucleique de 
P echantillon, ayant, le cas echeant, ete prealablement rendu accessible a 
P hybridation, dans des conditions permettant P hybridation de la sonde a 
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Pacide nucleique (Tune bacterie appartenant a Pespece Listeria 
monocytogenes ou a un micro-organisme associe ; 

b) mise en contact de Phybride forme entre la sonde nucleotidique 
immobilisee sur un support et Pacide nucleique contenu dans 

5 Pechantillon biologique, le cas echeant apres elimination de PADN de 

Pechantillon biologique n'ayant pas hybride avec la sonde, avec une 
sonde nucleotidique marquee selon P invention ; 

c) mise en evidence du nouvel hybride forme a Petape b). 

Ce procede est avantageusement utilise avec une puce a ADN selon 
10 Pinvention, Pacide nucleique recherche s'hybridant avec une sonde presente a la 
surface de ladite puce, et etant detecte par Putilisation d'une sonde marquee. Ce 
procede est avantageusement mis en oeuvre en combinant une etape prealable 
d' amplification de PADN ou de PADN complementaire obtenu eventuellement par 
transcription inverse, a Paide d' amorces selon Pinvention. 
1 5 Ainsi, la presente invention englobe egalement les kits ou necessaires pour 

la detection et/ou P identification de bacteries appartenant a Pespece Listeria 
monocytogenes ou a un micro-organisme associe, caracterise en ce qu'il comprend 
les elements suivants : 

a) une sonde nucleotidique selon Pinvention ; 
20 b) eventuellement, les reactifs necessaires a la mise en oeuvre d'une 

reaction d 5 hybridation ; 
c) eventuellement, au moins une amorce selon Pinvention ainsi que les 

reactifs necessaires a une reaction d 5 amplification de PADN. 
De meme, la presente invention englobe egalement les kits ou necessaires 
25 pour la detection et/ou P identification de bacteries appartenant a Pespece Listeria 
monocytogenes ou a un micro-organisme associe, caracterise en ce qu'il comprend 
les elements suivants : 

a) une sonde nucleotidique, dite sonde de capture, selon Pinvention; 

b) une sonde oligonucleotidique, dite sonde de revelation, selon 
30 Pinvention ; 

c) eventuellement, au moins une amorce selon Pinvention ainsi que les 
reactifs necessaires a une reaction d' amplification de PADN. 
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Enfin, les kits ou necessaires pour la detection et/ou V identification de 
bacteries appartenant a l'espece Listeria monocytogenes ou a un micro-organisme 
associe, caracterise en ce qu'il comprend les elements suivants : 
a) au moins une amorce selon 1' invention ; 
5 b) eventuellement, les reactifs necessaires pour effectuer une reaction 

d'amplification d'ADN ; 
c) eventuellement, un composant permettant de verifier la sequence du 
fragment amplifie, plus particulierement une sonde oligonucleotidique 
selon l'invention. 
1 0 sont egalement objets de la presente invention. 

De preference, lesdites amorces et/ou sondes et/ou polypeptides et/ou 
anticorps selon la presente invention utilises dans les precedes et/ou kits ou 
necessaires selon la presente invention sont choisis parmi les amorces et/ou sondes 
et/ou polypeptides et/ou anticorps specifiques de l'espece Listeria monocytogenes. 
15 De maniere preferee, ces elements sont choisis parmi les sequences nucleotidiques 
condant pour une proteine secretee, parmi les polypeptides secretes, ou parmi les 
anticorps diriges contre des polypeptides secretes de Listeria monocytogenes. 

La presente invention a egalement pour objet les souches de Listeria 
monocytogenes et/ou de microorganismes associes contenant une ou plusieurs 
20 mutation(s) dans une sequence nucleotidique selon l'invention, en particulier une 
sequence ORF, ou leurs elements regulateurs (en particulier promoteurs). 

On prefere, selon la presente invention, les souches de Listeria 
monocytogenes presentant une ou plusieurs mutation(s) dans les sequences 
nucleotidiques codant pour des polypeptides impliques dans la machine cellulaire, 
25 en particulier la secretion, le metabolisme intermediate central, le metabolisme 
energetique, les processus de synthese des acides amines, de transcription et de 
traduction, de synthese des polypeptides. 

Lesdites mutations peuvent mener a une inactivation du gene, ou en 
particulier lorsqu'elles sont situees dans les elements regulateurs dudit gene, a une 
30 surexpression de celui-ci. 

L'invention concerne en outre Tutilisation d'une sequence nucleotidique 
selon l'invention, d'un polypeptide selon l'invention, d'un anticorps selon 
l'invention, d'une cellule selon l'invention, et/ou d'un animal transforme selon 
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F invention, pour la selection de compose organique ou inorganique capable de 
moduler, de reguler, d'induire ou d'inhiber F expression de genes, et/ou de modifier 
la replication cellulaire de cellules eucaryotes ou procaryotes ou capables d'induire, 
d'inhiber ou d'aggraver les pathologies liees a une infection par Listeria 
5 monocytogenes ou un de ses micro-organismes associes. 

L' invention comprend egalement une methode de selection de composes 
capables de se lier a un polypeptide ou un de ses fragments selon V invention, 
capables de se lier a une sequence nucleotidique selon F invention, ou capable de 
reconnaitre un anticorps selon la revendication, et/ou capables de moduler, de 

10 reguler, d'induire ou d'inhiber Fexpression de genes, et/ou de modifier la 
croissance ou la replication cellulaire de cellules eucaryotes ou procaryotes, ou 
capables d'induire, d'inhiber ou d'aggraver chez un organisme animal ou humain 
les pathologies liees a une infection par Listeria monocytogenes, ou un de ses 
micro-organismes associes, caracterisee en ce qu'elle comprend les etapes 

1 5 suivantes : 

a) mise en contact dudit compose avec ledit polypeptide, ladite sequence 
nucleotidique, avec une cellule transformee selon F invention et/ou 
administration dudit compose a un animal transforme selon Finvention; 

b) determination de la capacite dudit compose a se lier avec ledit 
20 polypeptide ou ladite sequence nucleotidique, ou de moduler, de reguler, 

d'induire ou d'inhiber Fexpression de genes, ou de moduler la 
croissance ou la replication cellulaire, ou d'induire, d'inhiber ou 
d'aggraver chez ledit animal transforme les pathologies liees a une 
infection par Listeria monocytogenes ou un de ses micro-organismes 
25 associes. 

Les cellules et/ou les animaux transformes selon Finvention, pourront 
avantageusement servir de modele et etre utilises dans des precedes pour etudier, 
identifier et/ou selectionner des composes susceptibles d'etre responsables de 
pathologies induites ou aggravees par Listeria monocytogenes, ou susceptibles de 
30 prevenir et/ou de traiter ces pathologies comme par exemple les maladies genitales, 
oculaires ou systemiques, notamment du systeme lymphatique. En particulier, les 
cellules hotes transformees, notamment les bacteries de la famille des Listerias dont 
la transformation par un vecteur selon Finvention peut par exemple accroitre ou 
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inhiber son pouvoir infectieux, ou moduler les pathologies habituellement induites 
ou aggravees par Finfection, pourront etre utilisees pour infecter des animaux dont 
on suivra F apparition des pathologies. Ces animaux non transformes, infectes par 
exemple avec des bacteries Listerias transformees, pourront servir de modele 
5 d' etude. De la meme maniere, les animaux transformes selon F invention pourront 
etre utilises dans des procedes de selection de composes susceptibles de prevenir 
et/ou de traiter les maladies dues a Listeria. Lesdits procedes utilisant lesdites 
cellules transformees et/ou animaux transformes, font partie de F invention. 

Les composes susceptibles d'etre selectionnes peuvent etre des composes 
10 organiques tels que des polypeptides ou hydrates de carbone ou tous autres 
composes organiques ou inorganiques deja connus, ou des composes organiques 
nouveaux elabores a partir de techniques de modelisation moleculaire et obtenus par 
synthese chimique ou biochimique, ces techniques etant connues de l'homme de 
Tart. 

15 Lesdits composes selectionnes pourront etre utilises pour moduler la 

croissance et/ou la replication cellulaire de Listeria monocytogenes ou tout autre 
micro-organisme associe et ainsi pour controler T infection par ces micro- 
organismes. Lesdits composes selon F invention pourront egalement etre utilises 
pour moduler la croissance et/ou la replication cellulaire de toutes cellules 

20 eucaryotes ou procaryotes, notamment les cellules tumorales et les micro- 
organismes infectieux, pour lesquelles lesdits composes s'avereront actifs, les 
methodes permettant de determiner lesdites modulations etant bien connues de 
Fhomme de Fart. 

On entend designer par compose capable de moduler la croissance d'un 
25 micro-organisme tout compose permettant de d'intervenir, de modifier, de limit er 
et/ou de reduire le developpement, la croissance, la vitesse de proliferation et/ou la 
viabilite dudit micro-organisme. 

Cette modulation peut etre realisee par exemple par un agent capable de se 
Her a une proteine et ainsi d'inhiber ou de potentialiser son activite biologique, ou 
30 capable de se lier a une proteine membranaire de la surface exterieure d'un micro- 
organisme et de bloquer la penetration dudit micro-organisme dans la cellule hote 
ou de favoriser Taction du systeme immunitaire de Torganisme infecte dirige a 
Fencontre dudit micro-organisme. Cette modulation peut etre egalement realisee par 
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un agent capable de se Her a une sequence nucleotidique d'un ADN ou ARN d'un 
micro-organisme et de bloquer par exemple l'expression d'un polypeptide dont 
l'activite biologique ou structured est necessaire a la croissance ou a la 
reproduction dudit micro-organisme. 
5 On entend designer par micro-organisme associe dans la presente invention, 

tout micro-organisme dont l'expression de gene peut etre modulee, regulee, induite 
ou inhibee, ou dont la croissance ou la replication cellulaire peut etre egalement 
modulee par un compose de ^invention. On entend designer egalement par micro- 
organisme associe dans la presente invention, tout micro-organisme comportant des 

10 sequences nucleotidiques ou des polypeptides selon l'invention. Ces micro- 
organismes peuvent dans certains cas comporter des polypeptides ou des sequences 
nucleotidiques identiques ou homologues a celles de Invention pourront egalement 
etre detectes et/ou identifies par les precedes ou kit de detection et/ou 
d' identification selon l'invention et egalement servir de cible pour les composes de 

1 5 l'invention. 

L'invention concerne les composes susceptibles d'etre selectionnes par une 
methode de selection selon Pinvention. 

L'invention concerne egalement une composition pharmaceutique 
comprenant un compose choisi parmi les composes suivants : 
20 a) une sequence nucleotidique selon l'invention ; 

b) un polypeptide selon l'invention ; 

c) un vecteur selon l'invention ; 

d) un anticorps selon l'invention ; et 

e) un compose susceptible d'etre selectionne par une methode de selection 
- 5 selon l'invention, eventuellement en association avec un vehicule 

pharmaceutiquement acceptable. 
On entend designer par quantite efficace, une quantite suffisante dudit 
compose ou anticorps, ou de polypeptide de l'invention, permettant de moduler la 
croissance de Listeria monocytogenes ou d'un micro-organisme associe. 
30 L'invention concerne aussi une composition pharmaceutique selon 

l'invention pour la prevention ou le traitement d'une infection par une bacterie 
appartenant a Tespece Listeria monocytogenes ou par un micro-organisme associe. 
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L'invention vise en outre une composition immunogene et/ou vaccinale, 
caracterisee en ce qu'elle comprend un ou plusieurs polypeptides selon Tinvention 
et/ou un ou plusieurs polypeptides hybrides selon T invention. 

L'invention comprend aussi Fertilisation d'une cellule transformee selon 
5 l'invention, pour la preparation d'une composition vaccinale. 

L'invention vise egalement une composition vaccinale, caracterisee en ce 
qu'elle contient une sequence nucleotidique selon l'invention, un vecteur selon 
l'invention et/ou une cellule transformee selon l'invention. 

L'invention concerne egalement les compositions vaccinates selon 
10 l'invention, pour la prevention ou le traitement d'une infection par une bacterie 
appartenant a Fespece Listeria monocytogenes ou par un micro-organisme associe. 

De maniere preferee, les compositions immunogenes et/ou vaccinates selon 
l'invention destinees a la prevention et/ou au traitement ^infection par Listeria 
monocytogenes ou par un micro-organisme associe seront choisies parmi les 
1 5 compositions immunogenes et/ou vaccinales comprenant un polypeptide ou un de 
ses fragments correspondant a une proteine, ou un de ses fragments, de 1'enveloppe 
cellulaire de Listeria monocytogenes. Les compositions vaccinales comprenant des 
sequences nucleotidiques comprendront de preference egalement des sequences 
nucleotidiques codant pour un polypeptide ou un de ses fragments correspondant a 
20 une proteine, ou un de ses fragments, de 1'enveloppe cellulaire de Listeria 
monocytogenes. 

Parmi ces compositions immunogenes et/ou vaccinales preferees, les plus 
preferees sont celles comprenant un polypeptide ou un de ses fragments, ou une 
sequence nucleotidique ou un de ses fragments dont les sequences sont choisies 
25 parmi les sequences nucleotidiques ou d'acides amines identifiees dans ce groupe 
fonctionnel et listees precedemment. 

Les polypeptides de l'invention ou leurs fragments entrant dans les 
compositions immunogenes selon l'invention peuvent etre selectionnes par des 
30 techniques connues de l'homme de Fart comme par exemple sur la capacite desdits 
polypeptides a stimuler les cellules T, qui se traduit par exemple par leur 
proliferation ou la secretion d'interleukines, et qui aboutit a la production 
d'anticorps diriges contre lesdits polypeptides. 
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Chez la souris, chez laquelle une dose ponderale de la composition vaccinate 
comparable a la dose utilisee chez Fhomme est administree, la reaction anticorps est 
testee par prelevement du serum suivi d'une etude de la formation d'un complexe 
entre les anticorps presents dans le serum et Fantigene de la composition vaccinale, 
5 selon les techniques usuelles. 

Selon F invention, lesdites compositions vaccinales seront de preference en 
association avec un vehicule pharmaceutiquement acceptable et, le cas echeant, 
avec un ou plusieurs adjuvants de Fimmunite appropries. 

Aujourd'hui, divers types de vaccins sont disponibles pour proteger 

10 rhomme contre des maladies infectieuses : micro-organismes vivants attenues (M 
bovis - BCG pour la tuberculose), micro-organismes inactives (virus de la grippe), 
des extraits acellulaires (Bordetella pertussis pour la coqueluche), proteines 
recombinees (antigene de surface du virus de 1' hepatite B), des polyosides 
(pneumocoques), Des vaccins prepares a partir de peptides de synthese ou de micro- 

1 5 organismes genetiquement modifies exprimant des antigenes heterologues sont en 
cours d' experimentation. Plus recemment encore, des ADNs plasmidiques 
recombines portant des genes codant pour des antigenes protecteurs ont ete 
proposes comme strategie vaccinale alternative. Ce type de vaccination est realise 
avec un plasmide particulier derivant d 5 un plasmide de E. coli qui ne se replique pas 

20 in vivo et qui code uniquement pour la proteine vaccinante. Des animaux ont ete 
immunises en injectant simplement FADN plasmidique nu dans le muscle. Cette 
technique conduit a V expression de la proteine vaccinale in situ et a une reponse 
immunitaire de type cellulaire (CTL) et de type humoral (anticorps). Cette double 
induction de la reponse immunitaire est Fun des principaux avantages de la 

25 technique de vaccination avec de FADN nu. 

Les compositions vaccinales comprenant des sequences nucleotidiques ou 
des vecteurs dans lesquels sont inserees lesdites sequences, sont notamment decrites 
dans la demande internationale N° WO 90/11092 et egalement dans la demande 
internationale N° WO 95/1 1307. 

30 La sequence nucleotidique constitutive de la composition vaccinale selon 

F invention peut etre injectee a Fhote apres avoir ete couplee a des composes qui 
favorisent la penetration de ce polynucleotide a Finterieur de la cellule ou son 
transport jusqu'au noyau cellulaire. Les conjugues resultants peuvent etre 
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encapsules dans des microparticules polymeres, comme decrit dans la demande 
internationale N° WO 94/27238 (Medisorb Technologies International). 

Selon un autre mode de realisation de la composition vaccinale selon 
T invention, la sequence nucleotidique, de preference un ADN, est complexee avec 
5 du DEAE-dextran, avec des proteines nucleaires, avec des lipides ou encapsulee 
dans des liposomes ou encore introduite sous la forme d 7 un gel facilitant sa 
transfection dans les cellules. Le polynucleotide ou le vecteur selon l 5 invention peut 
aussi etre en suspension dans une solution tampon ou etre associe a des liposomes. 

Avantageusement, un tel vaccin sera prepare conformement a la technique 

10 decrite par Tacson et al. ou Huygen et al. en 1996 ou encore conformement a la 
technique decrite par Davis et al. dans la demande internationale N° WO 95/1 1307. 

Un tel vaccin peut etre egalement prepare sous la forme d'une composition 
contenant un vecteur selon F invention, placee sous le controle d' elements de 
regulation permettant son expression chez l'homme ou I'animal. On pourra par 

15 exemple utiliser, en tant que vecteur d' expression in vivo de l'antigene 
polypeptidique d'interet, le plasmide pcDNA3 ou le plasmide pcDNAl/neo, tous les 
deux commercialises par Invitrogen (R&D Systems, Abingdon, Royaume-Uni). 
Un tel vaccin comprendra avantageusement, outre le vecteur recombinant, une 
solution saline, par exemple une solution de chlorure de sodium. 

20 On entend designer par vehicule pharmaceutiquement acceptable, un 

compose ou une combinaison de composes entrant dans une composition 
pharmaceutique ou vaccinale ne provoquant pas de reactions secondaires et qui 
permet par exemple la facilitation de l'administration du compose actif, 
T augmentation de sa duree de vie et/ou de son efficacite dans l'organisme, 

25 Paugmentation de sa solubilite en solution ou encore Amelioration de sa 
conservation. Ces vehicules pharmaceutiquement acceptables sont bien connus et 
seront adaptes par 1'homme de Tart en fonction de la nature et du mode 
d ! administration du compose actif choisi. 

En ce qui concerne les formulations vaccinates, celles-ci peuvent 

30 comprendre des adjuvants de Timmunite appropries qui sont connus de Thomme de 
Tart, comme par exemple Thydroxyde d'aluminium, un representant de la famille 
des muramyl peptides comme un des derives peptidiques du N-acetyl-muramyl, un 
lysat bacterien, ou encore Padjuvant incomplet de Freund. 
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De preference, ces composes seront administres par voie systemique, en 
particulier par voie intraveineuse, par voie intramusculaire, intradermique ou sous- 
cutanee, ou par voie orale. De maniere plus preferee, la composition vaccinale 
comprenant des polypeptides selon P invention, sera administree a plusieurs 
5 reprises, de maniere etalee dans le temps, par voie intradermique ou sous-cutanee. 

Leurs modes d'administration, posologies et formes galeniques optimaux 
peuvent etre determines selon les criteres generalement pris en compte dans 
Petablissement d'un traitement adapte a un patient comme par exemple Page ou le 
poids corporel du patient, la gravite de son etat general, la tolerance au traitement et 
1 0 les efFets secondaires constates. 

L'invention comprend Putilisation d'une composition selon Pinvention, 
pour le traitement ou la prevention de maladies genitales, induites ou aggravees par 
Listeria monocytogenes. 

Enfin, Pinvention comprend Putilisation d'une composition selon 
15 Pinvention, pour le traitement ou la prevention de maladiesinduites ou aggravees 
par la presence de Listeria monocytogenes. 

Enfin, Pinvention comprend Putilisation d'une composition selon 
l'invention, pour le traitement ou la prevention de maladies systemiques, 
notamment du systeme lymphatique, induites ou aggravees par la presence de 
20 Listeria monocytogenes. 

Par ailleurs, la presente invention a egalement pour objet une banque 
d'ADN genomique d'une bacterie du genre Listeria, de maniere preferee, Listeria 
monocytogenes, de maniere preferee la souche EGD-e, ladite banque d'ADN etant 
clonee dans des chromosomes artificiels de bacteries (BAG). Une telle banque 
25 d'ADN genomique contient de tres larges inserts du genome de Listeria, en 
particulier des inserts d'une taille comprise entre 50 et 200 kb. 

Un des avantages de Putilisation du systeme de BAG par rapport a un 
systeme de cosmides est que le plasmide utilise n'est present qu'a une ou au 
maximum deux copies par cellule transformee, ce qui reduit le potentiel pour la 
30 recombinaison entre des fragments d'ADN, et de fa9on plus importante, ce qui 
elimine le risque de surexpression letale de genes clones bacteriens. Neanmoins, la 
presence du BAG en tant que simple copie signifie que PADN plasmidique doit etre 
extrait d'un large volume de culture afin d'obtenir suffisamment d'ADN pour la 
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sequence. De plus, la stabilite et la fidelite de maintenance des clones dans une 
banque de BAC permettent F identification de differences genomiques parmi 
difFerentes souches de listeria, et F identification de ces differences genetiques qui 
peuvent etre responsables des variations phenotypiques observees entre les 

5 difFerentes souches. 

La banque d'ADN genomique decrite dans la presente invention, en 
particulier la banque LM baclim deposee a la CNCM le 11 avril 2000 sous le 
numero d'ordre n°I-2439 recouvre en effet le genome de Listeria monocytogenes. 
Toutefois, bien que certaines regions n'aient pas pu etre donees dans ladite banque, 

0 en raison de problemes de letalites chez Escherichia coli, ces regions peuvent 
facilement etre amplifiees et identifies par Fhornme du metier, en utilisant des 
oligonucleotides specifiques des sequences des extremites des differents clones qui 
forment les contigs. 

La presente invention concerne egalement les methodes pour Fisolement 

5 d'un polynucleotide d'interet present chez une souche de Listeria et absente chez 
une autre souche, qui utilise au moins une banque d'ADN basee sur un BAC, 
contenant le genome de Listeria. La methode selon Finvention pour Fisolement 
d'un polynucleotide d'interet peut comprendre les etapes suivantes : 

a) isoler au moins un polynucleotide contenu dans un clone de la banque 
d'ADN basee sur un BAC, d'origine de listeria, 

b) isoler : 

- au moins un polynucleotide genomique ou ADNc d'une listeria, ladite 
listeria appartenant a une souche differente de la souche utilisee pour la 
construction de la banque d'ADN BAC de Fetape a) ou, de fa?on 
alternative, 

- au moins un polynucleotide contenu dans un clone d'une banque d'ADN 
basee sur un BAC prepare a partir du genome d'une listeria qui est 
differente de la listeria utilisee pour la construction de la banque d'ADN 
basee sur le BAC de Fetape a). 

c) hybrider le polynucleotide de Fetape a) au polynucleotide de Fetape b) ; 

d) selectionner les polynucleotides de Fetape a) qui n'ont pas forme de 
complexe d' hybridation avec les polynucleotides de Fetape b) ; 

e) caracteriser le polynucleotide selectionne. 
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On peut preparer le polynucleotide de l'etape a) par la digestion d'au moms 
un clone recombinant BAC avec une enzyme de restriction appropriee, et de fagon 
optionnelle, 1 5 amplification de Pinsert polynucleotide qui en resulte. 

Ainsi, la methode de Tinvention permet a Fhomrne du metier d'effectuer des 
5 etudes genomiques comparatives entre les differentes souches ou especes du genre 
listeria, par exemple entre les souches pathogeniques et leurs equivalant non 
pathogenes. 

En particulier, il est possible d'etudier et de determiner les regions de 
polymorphisme entre lesdites souches. 

10 

EXEMPLES 

Exemple 1 : Fabrication de la banque BAC d'ADN de Listeria monocytogenes 

Les blocs contenant FADN chromosomique de Listeria monocytogenes, 
d'un poids moyen de 80 mg ont ete prepares dans P agarose, par des methodes 
15 connues de Thomme du metier. lis ont ete gardes dans une solution de 500 mM 
EDTA, pH8,0. 

On utilise 8 blocs pour la construction de la banque. Ces blocs d'agarose 
sont laves deux fois pendant 30 min sur de la glace dans du TE/PMSF (10 mM Tris- 
HC1, pH8,0 ; 1 mM EDTA ; 100 )llM PMSF). Puis, on lave les blocs d'agarose deux 

20 fois pendant 30 min sur de la glace dans du TE (10 mM Tris-HCI, pH8,0 ; 1 mM 
EDTA). Chaque bloc d'agarose est coupe en huit fines tranches, et seize tranches 
sont mises ensemble dans un seul tube. Les blocs sont alors incubes dans une 
solution de pre-incubation (20 jliM Spermidine, 5 mM DTT, tampon de restriction 
lx), pendant 40 min sur de la glace. On change le tampon de pre-incubation une 

25 fois et on Pincube de nouveau pendant 40 min. 

Les digestions partielles sont effectuees dans un tampon de digestion (2 mM 
Spermidine, 0,5 mM DTT, 0,02 \ig BSA, tampon de restriction 0,5x) pendant 30 
min a 4°C, sans enzyme de restriction, puis on ajoute pendant 2 h a 37°C, 0,1 ; 
0,25 ; 0,4 ou 0,5 d'unite d'EcoRI (life Tech) par tube. La digestion partielle est 

30 arretee par remplacement du tampon de digestion par 200 \xl EDTA 0,5 M, les tubes 
etant places sur de la glace. 
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Les blocs d 5 agarose sont alors places sur un gel cf agarose 1 % SeaKem 
GTG (FMC), 0,5x tampon TBE. On effectue alors le gel d'electrophorese en 
champs pulse avec les conditions suivantes : 

temps de pulsation initial : 90 secondes, 
5 temps de pulsation final : 90 secondes, 

6 V/cm, 

angle inclus : 120 °, 
dureedegel : 16h, 12°C. 

On coupe la region comprise entre 50 et 200 kb a partir du gel, et on la 
10 separe en trois morceaux (50-100 kb, 100-150 kb, 150-200 kb). Le gel pour le 
clonage ne doit pas etre teinte avec le bromure d'etidium. 

Ces blocs d'agarose sont places dans un nouveau gel d'agarose 1 % SeaKem 
GTG (FMC), 0,5x tampon TBE. On effectue un nouveau gel d'electrophorese en 
champs pulse avec les conditions suivantes : 
1 5 temps initial de pulsation : 5 secondes, 

temps final de pulsation : 5 secondes, 
4 V/cm, 

angle inclus : 120°, 
dureedegel : 15 h, 12°C, 
20 On decoupe de nouveau, sans avoir teinte 1'ADN avec du bromure 

d'etidium, la region comprise entre 50 et 200 kb et on la separe en trois morceaux 
(50-100 kb, 100-150 kb, 150-200 kb). 

Les morceaux d'agarose sont coupes en petits morceaux d' environ 100 mg 

chacun. 

25 L' agarose est incube a 67°C pendant 10 min puis refroidi a 42°C, et on 

ajoute 1 \il de Beta-Agarase (FMC, lU/pl). On incube pendant 30 min a 42°C, et on 
effectue une denaturation de la Beta-Agarase apres digestion complete de Pagarose 
par incubation pendant 10 min a 67°C, puis incubation sur glace. 

150 ng de vecteurs d'ADN digeres par EcoRI et des phophoryles (CEP, 

30 Roche) sont utilises pour la construction de la banque de BAC. On utilise le vecteur 
pBelaBAC-Kan (Mozo et al., Mol. Gen. Genet., 1998, 258, 562-70) pour la 
construction de ladite banque. On incube done 150 ng dudit vecteur avec 150 ng 
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d' inserts d'ADN, dans un tampon de ligation lx avec 5 unites de Ligase (USB 1 
unite/(Lil) pendant 16 h a 12°C. 

Le tampon de ligation est le tampon recommande par le fabricant. 
La transformation est effectuee en ajoutant 5 jixl de la reaction de ligation a 
5 40 pi de cellules electrocompetentes DH10B, et Telectroporation est effectuee dans 
un electroporateur Life Tech avec les conditions suivantes : 

330 jlxF, 4 kn, DC Volts :lowQ, taux de charge : rapide, cuvette 
d'electroporation 1,5 mm. 
On ajoute 1 mm de milieu de culture SOC aux cellules et on incube pendant 
10 45 min a 37°C. On etale ensuite les cellules sur une boite contenant du LB agar et 
qui contient egalement 30 g/ml de kanamicine, 100 pg/ml d'IPTG et 40 pg/ml d'X- 
Gal. On selectionne les cellules recombinantes qui sont blanches par rapport aux 
cellules bleues, et on caracterise la banque de BAC en effectuant des cartes de 
restriction de chacun des clones ainsi obtenus. 
15 Exemple 2 : Annotations et analyse de la sequence genomique de Listeria 
monocytogenes 

La bio-informatique a un role cle dans les trois phases d'un projet genome : 
suivi des inserts realises par la methode de sequengage aleatoire, phase de finition 
de la sequence genome, et annotations. Les Inventeurs ont developpe un logiciel 

20 complet qui permet de repondre a ces trois exigences : GMP-Tool-box (GMPTB). 

Suivi : Durant la procedure de sequengage aleatoire, GMPTB extrait des 
fichiers resultats (format Phrap) toutes les caracteristiques necessaires a 
Tassemblage (nombre de contigs, nombre de sequences...) et les affiche dans un 
tableau. Ce tableau peut etre utilise pour creer des graphiques qui montrent la 

25 progression de cette methode et qui permettent une identification rapide des 
problemes d'assemblage. De fagon importante, GMPTB permet la comparaison 
entre les resultats d' assemblage et cree une page HTML afin de decrire la relation 
entre les nouveaux et les anciens contigs (fusion, creation. . .). 

Phase de finition de la sequence : Differentes strategies sont utilisees par 

30 GMPTB pour predire les liaisons entre les contigs. GMPTB recherche en particulier 
tous les clones permettant les liaisons, sur la base de la localisation et de 
l'orientation des sequences terminates- II peut aussi indiquer les misassemblages. 
GMPTB peut aussi predire les liaisons, sur la base de comparaisons de genomes, en 
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recherchant les similarites entre Ies extremites des contigs et d'autres sequences 
genomiques (haut niveau nucleotidique et des acides amines). 

Annotation s : GMPTB permet de commencer F annotation durant la phase 
terminate. De fait, GMPTB cree un fichier individuel de proteines (IPF), pour 
5 chaque phase ouverte de lecture (ORF) lors de Fassemblage. Ce sont des fichiers 
texte, dans un format specifique qui content trois categories de champs : 

les champs minimums contiennent un numero d 5 identification, un 
numero de version, la localisation et les sequences. La sequence 
nucleotidique exportee correspond a la sequence de la phase ouverte de 
1 0 lecture avec 500 bases additionnelles avant le premier codon stop et 200 

bases additionnelles apres le second stop. 
- le champ automatique contient des resultats ajoutes par differents 
programmes a FIPF. II concerne la sequence d'ADN (recherche de sites 
de liaisons de ribosomes, promoteurs ou terminateurs, capacite 
1 5 codante. . . ) et la sequence proteique predite (homologie, domaine. . . ). 

le champ manuel contient des resultats et des commentaires ajoutes par 
les utilisateurs. Apres un nouvel assemblage, GMPTB extrait toutes les 
ORF et cree de nouveaux IPF selon les sequences IPF derives des 
assemblages precedents. GMPTB reconnatt les IPF modifies, qui sont 
20 les seuls utilises pour une nouvelle analyse automatique apres chaque 

assemblage. 

La specificite de cette strategic est que F annotation des EPF est independante 
de Fetape de Fassemblage, a moins que sa sequence ne soit modifiee. Les IPF sont 
relies a une banque de donnee genomique Sybases (du modele SubtiList) et sont 
25 accessibles grace a F intermediate d'un serveur web. lis peuvent etre modifies et 
annotes par differents Inventeurs durant la phase de projet de genome. 
De pot de materiel biologique 

Les organismes suivants ont ete deposes le 11 Avril 2000 a la Collection 
Nationale de Cultures de Microorganismes (CNCM), 25 rue du Docteur Roux, 
30 75724 Paris Cedex 15, France, selon les dispositions du Traite de Budapest : 

- Souche de Listeria monocytogenes EGD-e, Numero d'ordre 1-2440 ; 

- Banque BAC d'ADN de Listeria (145 clones), LM-baclib, Numero d'ordre I- 
2439. Ladite banque BAC (1-2439) a ete realisee dans la souche E.coli DH10B 
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(Grant et al., PNAS, 87, 4645, 1990), construite apres digestion partielle de 
PADN de Listeria monocytogenes par P enzyme EcoRI dans le vecteur 
pBelaBAC-Kan (Mozo et al, Mol. Gen. Genet., 1998, 258, 562-70). 

TABLEAU 1 



SeqID 



Norn Localisationde la sequence 
nucleique de I'ORF sur la 
SEQ ID N° 1 



SeqID n° 2 LM- 1000.1 From 589066 to 589362 

SeqID n° 3 LM-1 002.1 From 587012 to 589033 

SeqID n° 4 LM-1050.1 From 2907153 to 2909708 

SeqID n° 5 LM-1 179.1 From 761552 to 763468 

SeqID n° 6 LM-1 18.2 From 2787416 to 2788363 

SeqID n° 7 LM-1235.1 From 875721 to 881855 

SeqID n° 8 LM-1248.1 From 865530 From 865530 

SeqID n° 9 LM-1248.1 From 865530 From 865530 

SeqID n° 10 LM-1305.1 From 919020 to 920408 

SeqID n° 1 1 LM-1490.1 From 649864 to 651633 

SeqID n° 12 LM-1514.1 From 664242 to 668990 



SeqID n° 13 LM-1660.3 From 501312 to 501617 
SeqID n° 14 LM-1 738.1 From 547520 to 549337 

SeqID n° 15 LM-1756.2 From 2679599 to 2681125 



SeqID n° 16 LM-1 778.1 From 1442368 to 1443687 



SeqID n° 17 LM-1972.3 From 1313654 to 1315435 



Fonction 



Unknown, pepdidoglycan 
bound protein (LPXTG 
motif) 
Unknown, similar to 
internalin protein 
Unknown, LPXTG protein 
with LRR repeats 
Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
Unknown, peptidoglycan 
anchored protein (LPXTG 
motif) 

Unknown, surface protein 
(LPXTG motif) 

unknown, surface protein 
(LPXTG motif) 

unknown, surface protein 
(LPXTG motif) 

Unknown, similar to wail 

associated protein 
precursor (LPXTG motif) 
Unknown, similar to 
internalin proteins 
Unknown, peptidoglycan 
bound protein (LPXTG 
motif) similar to adhesin 
Unknown 
Unknown, similar to 
internalin proteins 
Unknown, surface protein 
(GW repeat) similar to N- 
acetylmuramidase 
Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
Unknown, similar to 
internalin proteins 
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SeqID n° 18 
SeqID n° 19 
SeqID n° 20 
SeqID n°21 
SeqID n° 22 

SeqID n° 23 

SeqID n° 24 

SeqID n° 25 

SeqID n° 26 

SeqID n° 27 

SeqID n° 28 

SeqID n° 29 

SeqID n° 30 
SeqID n° 31 
SeqID n° 32 

SeqID n° 33 

SeqID n° 34 

SeqID n° 35 
SeqID n° 36 



LM-1 974.3 
LM-2 137.2 
LM-229.1 
LM-2323.1 
LM-2435.1 

LM-2438.1 

LM-2503.1 

LM-2504.1 

LM-3009.3 

LM-311.2 

LM-3144.1 

LM-3369.1 



LM-3752.3 
LM-757.1 



48 

From 1315767 
From 360936 

From 1170002 
From 344850 
From 159663 

From 157089 

From 2108500 

From 2106329 

From 1149887 

From 131419 

From 351459 

From 1869532 



From 357775 
From 2544267 



to 1317563 
to 366272 
to 1171621 
to 346049 
to 161378 

to 159470 

to 2109603 

to 2108209 

to 1152475 

to 133773 

to 355505 

to 1872243 

to 1722328 
to 830108 
to 1107759 

to 2164011 

to 2657803 

to 359676 
to 2545433 



LM-3418.2 From 1717193 

LM-3477.1 From 828168 

LM-3609. 1 From 1 1 06041 

LM-3691.2 From 2162323 

LM-3700.2 From 2653211 



Unknown, similar to 
internalin proteins 
Unknown, similar to 
internalin proteins 
Unknown, putative 
interanlin, similar to InIA 
Unknown, similar to 
surface proteins 
unknwon, surface 
anchored protein (LPXTG 
motif) 
unknwon, surface 
anchored protein (LPXTG 
motif) 

Unknown, putative cell 
surface protein, similar to 
internalin proteins 
Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
unknown, similar to 
fibrinogen-binding protein 
(LPXTG motif) 
Unknwon, similar to 5 - 
nucleotidase (LPXTG 
motif) 

Unknown, similar to cell 
surface proteins (LPXTG 
motif) 
Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
unknown, peptidoglycan 
linked protein (LPxTG) 
Unknown, similar to 

internalin 
unknown, similar to 
AUTOLYSIN (EC 3.5.1.28) 
(N-ACETYLMURAMOYL- 
L-ALANINE AMIDASE) 

Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
Unknwon, peptidoglycan 
anchored protein (LPXTG 
motif) 
Unknown, similar to 

internalin 
Unknown, similar to 
internalin proteins 
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SeqID n° 37 


LM-814.2 


From 2264772 


to 2268230 


Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 


SeqID n 38 


LM-816.2 


From 2259753 


to 2264591 


Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 


SeqID n° 39 


LM-894.1 


From 2469093 


to 2471915 


Unknown, similar to 
internalin proteins 


SeqID n 40 


LM-966.1 


From 173309 


to 1 74556 


Unknown, cell wall 
anchored protein 


oeqiu n 41 


LM-y/o.l 


From 169510 


to 172008 


Unknwon, similar to 
internalin proteins 


SeqID n° 42 


LM-133.1 


From 1230773 


to 1232308 


Unknown, similar to cobyric 
acid synthase CbiP 


SeqID n° 43 


LM-134.1 


From 1229967 


to 1230773 


Unknown, similar to cobalt 
transport ATP-binding 
protein CbiO 


SeqID n° 44 


LM-135.1 


From 1229277 


to 1229954 


Unknown, similar to cobalt 
transport protein Q 


SeqID n° 45 


LM-136.1 


From 1228994 


to 1229290 


Unknown, similar to 
putative cobalt transport 
protein CbiN 


SeqID n° 46 


LM-137.1 


From 1228263 


to 1228997 


Unknown, similar to 
cobalamin biosynthesis 
protein M 


SeqID n° 47 


LM-138.1 


From 1227556 


to 1228266 


Unknown, similar to S- 
adenosyl-methionine: 
precorrin-2 
methyltransferase 


SeqID n° 48 


LM-139.1 


From 1226778 


to 1227563 


unknown, similar to 
anaerobic Cobalt 
Chelatase In Cobalamin 
Biosynthesis 


Origin «o A C\ 

beqiu n 49 


1 R ft AAA A 

LM-141,1 


From 1225300 


to 1226781 


Unknown, similar 
uroporphyrinogen-lll 
methyltransferase/uroporph 
yrinogen-lll synthase 


SeqID n° 50 


LM-142.1 


From 1224548 


to 1225300 


Unknown, similar to 
cobalamin biosynthesis J 
protein CbiJ 


oeqiu n ol 


1 KA *i AO a 

LM-14o.1 


From 122oo2o 


to 1224551 


Unknown, similar to 
precorrin methylase 


oeqiLJ n 


LIVI- i44. 1 


( ' . t-vi H 0007QQ 

rrom \z.zz/z)o 


to 122oo29 


Unknown, similar to 
cobalamin biosynthesis 
protein G CbiG 


SeqID n° 53 


LM-145.1 


From 1222062 


to 1222811 


Unknown, similar to 
precorrin-3 methyiase 


SeqID n° 54 


LM-146.1 


From 1221487 


to 1222056 


Unknown, similar to 
precorrin decarbocylase 


SeqID n° 55 


LM-147.1 


From 1220901 


to 1221497 


Unknown, similar to 
precorrin methylase 
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SeqID n° 56 LM-149.1 From 1219783 to 1220904 



SeqfD n° 57 
SeqID n° 58 



SeqID n° 59 
SeqID n° 60 



LM-150.1 From 1219135 to 1219767 
LM-151.1 From 1218175 to 1219122 



LM-152.1 From 1216963 to 1217322 
LM-181.1 From 1196097 to 1197182 



SeqID n° 61 LM-207.1 From 1180152 to 1181036 

SeqID n° 62 LM-209.1 From 1 179168 to 1 179743 

SeqID n° 63 LM-210.1 From 1178421 to 1179167 

SeqID n° 64 LM-212.1 From 1177862 to 1178419 



SeqID n° 65 LM-1.3 From 2710869 to 2712755 

SeqID n° 66 LM-10.1 From 2718312 to 2719055 

SeqID n° 67 LM-100.1 From 2775644 to 2776486 

SeqID n° 68 LM-1003.1 From 586329 to 586994 

SeqID n° 69 LM-1 004.1 From 585015 to 585962 



SeqID n° 70 LM-1 005.1 From 583507 to 584757 



SeqID n° 71 LM-1007.1 From 583072 to 583452 
SeqID n° 72 LM-1008.1 From 582526 to 583047 



SeqID n° 73 LM-1009.1 From 581519 to 582505 



SeqID n° 74 LM-1 01 . 1 From 2776532 to 2776777 



Unknown, similar to 
cobalamin biosynthesis 
protein CbiD 
Unknown, similar to 
precorrin isomerase 
Unknown, similar to 
cobalamine synthesis 
protein CbiB 
FALSE ORF 
unknown, similar to 
Salmonella typhimurium 
CobD protein and to 
histidinol-phosphate 
aminotransferase 
Regulatory protein similar 
to Salmonella typhimurium 

PocR protein 
unknown, similar to alpha- 
ribazole-5 -phosphatase 
unknown, highly similar to 

cobalamin (5 - 
phosphatase) synthetase 
unknown, similar to 
bifunctional cobalamin 
biosynthesis protein CopB, 

(cobinamide kinase; 
cobinamide phosphatase 

guanylyltransferase) 
Unknown, similar to NADH 
dehydrogenase 
Unknown 
Unknown, similar to 
aldo/keto reductase 
Unknown 
Unknown, similar to B. 

subtilis DeoR 
transcriptional regulator 
Unknown, similar to 
putative NAD(P)- 
dependent oxidoreductase 

Unknown 
Unknown, similar to PTS 
system, glucitol/sorbitol- 
specific enzyme II Cll 
component 
Unknown, similar to PTS 
system, glucitol/sorbitol- 
specific enzyme IIBC 

component 
unknown, similar to B. 
subtilis YaaL protein 
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SeqID n° 75 LM-1010.1 From 581150 to 581500 



SeqiD n° 76 LM-1 01 1.1 From 580119 to 581039 



SeqiD n° 77 
SeqID n° 78 

SeqID n° 79 

SeqID n° 80 

SeqID n° 81 
SeqID n° 82 
SeqID n° 83 

SeqID n° 84 

SeqID n° 85 

SeqID n° 86 
SeqID n° 87 
SeqID n° 88 
SeqID n° 89 



LM-1012.1 From 578886 to 580079 

LM-1 01 3. 1 From 577632 to 578648 

LM-1015.1 From 576403 to 577584 

LM-1017.1 From 575166 to 576437 

LM-1 01 8.1 From 573740 to 575062 

LM-1 02. 1 From 2776792 to 2777388 

LM-1 020.1 From 572600 to 573568 

LM-1 022.1 From 571204 to 572559 

LM-1 023.1 From 570916 to 571 185 

LM-1 024. 1 From 570008 to 570838 

LM-1 025. 1 From 569094 to 569948 

LM-1029.1 From 567030 to 569081 

LM-103.1 From 2777497 to 2777814 



SeqID n° 90 LM-1031.1 From 565764 to 567014 



SeqID n° 91 
SeqID n° 92 
SeqID n° 93 



SeqID n° 94 
SeqID n° 95 
SeqID n° 96 



LM-1 032.1 From 564257 to 565753 
LM-1033.1 From 562789 to 564264 
LM-1 036.1 From 561819 to 562559 



LM-1 037. 1 From 560443 to 561 774 
LM-1040.2 From 2915744 to 2916097 
LM-1041.2 From 2915219 to 2915641 



Unknown, similar to PTS 
system, glucitol/sorbitol- 
specific enzyme IIA 
component 
Unknown, similar to ABC 
transporter (binding 

protein) 
Unknown, similar to 
penicillin-binding protein 

Unknown, similar to 
tagatose-1 ,6-diphosphate 

aldolase 
Unknown, similar to N-acyl- 
L-amino acid 
amidohydrolase 
Unknown, similar to N- 
carbamyl-L-amino acid 

amidohydrolase 
Unknown, similar to 6~ 
phospho-beta-glucosidase 
unknown, highly similar to 
recombination protein recR 

Unknown, similar to 
transcription regulator (Lacl 
family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown 
Unknown 
Unknown 
Unknown, highly similar to 
B. subtilis YaaK protein 
Unknown, conserved 
hypothetical protein similar 

to putative 
glucosaminyltransferase 
Unknown, hypothetical 
secreted protein 
Unknown, transmembrane 
protein 
Unknown, similar to 
transcription regulator 
(TipA from Streptomyces 
coelicolor) 
Unknown 
Unknown 
Unknown similar to 
transcriptional regulator 
(MarR family) 
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SeqID n° 97 


LM-1 043.1 


From 2914057 


to 2915205 


SeqID n° 98 


LM-1 045.1 


From 2912595 


to 2913686 


SeqID n° 99 


LM-1 047.1 


From 2911415 


to 2912602 


SeqID n° 100 


LM-1 048.1 


From 2910085 


to 291 1326 


SeqID n° 101 
oeqiu n iuz 


LM-1 049.1 


From 2909753 
hrom 2904598 


to 2910076 
to 2906940 


SeqID n° 103 


LM-1 054.1 


From 2903200 


to 2904402 


SeqID n° 104 


LM-1 055.1 


From 2901800 


to 2903200 


SeqID n° 105 


LM-1 056.1 


From 2900594 


to 2901784 


SeqID n° 106 


LM-1 057.1 


From 2899313 


to 2900506 


oeqiu n iu/ 


1 l\JI AClCO A 


From 2898535 


to 2899266 


oeqiu n lUo 


i kji a ncn a 
LM-1Uoy.1 


From 2897847 


to 2898374 


oeqiu n iuy 
SeqID n° 110 


t ti A A nan a 

LM-1 061.1 


From 28971 16 
From 2896241 


to 2897823 
to 2897059 


SeqID n° 111 


LM-1 064.1 


From 2894510 


to 2895883 


SeqID n° 112 


LM-1 066.1 


From 2892596 


to 2894485 


SeqID n° 113 


LM-1 067.2 


From 2892057 


to 2892437 


SeqID n° 114 
SeqID n° 115 


LM-1 069.3 
LM-1 07.1 


From 434945 
From 2777844 


to 435817 
to 2779583 


SeqID n° 116 


LM-1 070.1 


From 433153 


to 434745 


SeqID n° 117 
SeqID n° 118 
SeqID n° 119 
SeqID n° 120 


LM-1 074.1 
LM-1075.1 
LM-1 076.1 
LM-1 077.1 


From 429630 
From 428981 
From 428576 
From 428127 


to 432095 
to 429403 
to 428968 
to 428507 


SeqID n° 121 


LM-1 079.1 


From 427104 


to 4281 11 



Unknown, similar to efflux 

proteins 
unknown, highly similar to 
phosphoserine 
aminotransferase 
unknown, similar to D-3- 
phosphoglycerate 
dehydrogenase 
unknown, similar to an 
hypothetical protein from 
Thermotoga maritima 
unknown 
Unknown, amino-terminal 
domain similar to 
transcription regulators 
unknown, similar to 
carboxypeptidase 
Unknown, similar to 
transmembrane efflux 
protein 
unknown, similar to 

peptidases 
unknown, similar to 

transport protein 
Unknown, similar to 

reductases 
Unknown, similar to 
transcriptional regulator 

Unknown 
unknown, similar to D- 
alanyl-D-alanine 
carboxypeptidase 
Unknown, similar to 
GTPase 
unknown, highly similar to 
GidA protein 
Unknown, hypothetical 
secreted protein 
Unknown 
unknown, highly similar to 

DNA polymerase ill 
(gamma and tau subunits) 
Unknown, similar to 
phosphoenolpyruvate 
synthase (N-terminal part) 
internalin 
Unknown 
Unknown 
Unknown, similar to B. 
subtilis YyaH protein 
Unknown, similar to 
phosphate transport protein 
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SeqID 


n° 


122 


LM-108.1 


From 2779910 


to 2780374 


Unknown 


SeqID 


n° 


123 


LM-1 080.1 


From 426597 


to 426995 


Unknown 


SeqID 


n° 


124 


LM-1081.1 


From 426190 


to 426606 


Unknown 


SeqID 


n° 


125 


LM-1 082.1 


From 424130 


to 426064 


Unknown, similar to 
transcriptional 
antiterminator (BgIG family) 


SeqID 


n° 


126 


LM-1 083.1 


From 421469 


to 424096 


Unknown, highly similar to 

E. col YbgG protein, a 
putative sugar hydrolase 


SeqID 


n° 


127 


LM-1084.1 


From 420335 


to 421447 


Unknown, similar to 

fructose-specific 
phosphotransferase 
enzyme IIC 


SeqID 


n° 


128 


LM-1085.1 


From 419999 


to 420331 


Unknown, similar to 

fructose-specific 
phosphotransferase 
enzyme IIB 


SeqID 


n° 


129 


LM-1 088.1 


From 419544 


to 420002 


Unknown, similar to 
phosphotransferase 
system enzyme HA 


SeqID 


n° 


130 


LM-1 089.1 


From 418891 


to 419373 


Unknown, similar to 
unknown proteins 


SeqID 


n° 


131 


LM-1 09.1 


From 2780933 


to 2782297 


Unknown, similar to PTS 
system, cellobiose-specific 
enzyme IIC 


SeqID 


n° 


132 


LM-1 091.1 


From 417963 


to 418763 


Unknown, similar to 1- 
pyrroline-5-carboxylate 
reductase (ProC) 


SeqID 


n° 


133 


LM-1 092.1 


From 417535 


to 417960 


Unknown, weakly similar to 
blasticidin S- 
acetyltransferase 


SeqID 


n° 


134 


LM-1 093.1 


From 416796 


to 417479 


Unknown, similar to L. 

monocytogenes 
extracellular P60 protein 


SeqID 


n° 


135 


LM-1 094.1 


From 416203 


to 416652 


Unknown 


SeqID 


n° 


136 


LM-1 097.1 


From 415224 


to 416168 


Unknown, highly similar to 
B. subtilis YqfA protein 


SeqID 


n° 


137 


LM-1 098.1 


From 414928 


to 415227 


Unknown 


SeqID 


n° 


138 


LM-1 099.1 


From 414095 


to 414760 


Unknown, similar to uracil- 
DNA glycosylase 


SeqID 


n° 


139 


LM-1 1.1 


From 2719056 


to 2719775 


Unknown 


SeqID 


n° 


140 


LM-1 100.1 


From 412849 


to 413964 


Unknown, low temperature 
requirement protein A 


SeqID 


n° 


141 


LM-1 101.1 


From 412457 


to 412825 


Unknown 


SeqID 


n° 


142 


LM-1 102.1 


From 411992 


to 412363 


Unknown, similar to B. 
subtilis YhdG protein 


SeqID 


n° 


143 


LM-1 105.1 


From 409944 


to 411860 


Unknown, similar to B. 
subtilis lolD protein, to 
acetolactate synthase 


SeqID 


n° 


144 


LM-1 107.1 


From 408954 


to 409931 


Unknown, similar to B. 
subtilis lolC protein and to 
fructokinase 



WO 01/77335 



PCT/FR01/01118 



54 



SeqID n° 


145 


LM-1 108.1 


From 408116 


to 408937 


SeqID n° 


146 


LM-1 110.2 


From 406637 


to 408103 


SeqID n° 


147 


LM-1 112.2 


From 405680 


to 406441 


SeqID n° 


148 


LM-1 114.1 


From 405221 


to 405607 


SeqID n° 


149 


LM-1 115.3 


From 404365 


to 404994 


SeqID n° 


150 


LM-1 116.4 


From 807976 


to 810792 


SeqID n° 


151 


LM-1 117.1 


From 807255 


to 807689 


SeqID n° 


152 


LM-1 118.1 


From 806770 


to 807255 


SeqID n° 


153 


LM-1 12.1 


From 2783466 


to 2784107 


SeqID n° 


154 


LM-1 121.1 


From 805784 


to 806596 


SeqID n° 


155 


LM-1 122.1 


From 804887 


to 805765 


Rpnin n° 

OC^J | \-J 1 1 


15R 


I M-1 19"^ 1 


PI UNI OU*tOO^ 


CO OUH- / 


SeqID n° 


157 


LM-1 125.1 


From 803745 


to 804212 




I JO 


LIVI- I liCO. I 


rrorn ouozu i 


IO C3U004o 


SeqID n° 


159 


LM-1 127.1 


From 802356 


to 802739 






I— |V|- I I 


riuiTi ou looo 


IO oUZ/UZ 


SeqID n° 


161 


LM-1 129.1 


From 800965 


to 801297 


SeqID n° 


162 


LM-1 13.1 


From 2784354 


to 2784674 


SeqID n° 


163 


LM-1 130.1 


From 799996 


to 800925 


SeqID n° 


164 


LM-1131.1 


From 798828 


to 799817 


SeqID n° 


165 


LM-1 132.1 


From 798141 


to 798788 


SeqID n° 


166 


LM-1 133.1 


From 797353 


to 798105 



Unknown, similar to B. 
subtilis loiB protein 
Unknown, highly similar to 
B. subtilis methylmalonate- 
semialdehyde 
dehydrogenase lolA 
Unknown, similar to B. 
subtilis transcription 
repressor of myo-inositol 
cataboiism operon loIR 
Unknown 
Unknown 
Unknown, similar to 
transcriptional regulator 
(NifA/NtrC family) 
Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
HA 

Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
MB 
Unknown 
Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
IIC 

Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
IID 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown, similar to 
transcription regulator 
(repressor) 
Unknown 
unknown, similar to 
hypothetical proteins 
Unknown, conserved 
hypothetical protein 
Unknown, similar to alcohol 
dehydrogenase 
Unknown, similar to 
transcription regulator 
Unknown 
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SeqlDn°167 LM-1 134.1 From 796075 to 797193 



SeqlDn°168 LM-1 135.1 From 795034 

SeqlDn°169 LM-1 136.1 From 793777 

SeqlDn°170 LM-1137.1 From 792879 

SeqlDn°171 LM-1138.1 From 791977 



SeqID n° 172 
SeqID n° 173 

SeqID n° 174 

SeqID n° 175 

SeqID n° 176 

SeqID n° 177 

SeqID n° 178 
SeqID n° 179 
SeqID n° 180 
SeqID n° 181 



LM-1 139.1 
LM-1 14.1 

LM-1 140.1 

LM-1 141.1 

LM-1 142.1 

LM-1 143.1 

LM-1 144.1 
LM-1 146.1 
LM-1 147.1 
LM-1 149.1 



From 790662 
From 2784805 

From 789520 

From 788642 

From 787320 

From 786372 

From 785754 
From 784788 
From 783901 
From 782794 



to 796062 
to 795030 

to 793754 
to 792867 

to 791960 
to 2786319 

to 790509 

to 789514 

to 788576 

to 786992 

to 786362 
to 785747 
to 784788 
to 783774 



SeqID n° 182 LM-1151.1 From 781896 to 782801 

SeqID n° 183 LM-1152.1 From 781103 to 781924 

SeqID n° 184 LM-1 153.2 From 780377 to 780988 

SeqID n° 185 LM-1154.1 From 779616 to 780296 

SeqID n° 186 LM-1 155.1 From 778654 to 779490 



SeqID n° 187 
SeqID n° 188 
SeqID n° 189 
SeqID n° 190 
SeqID n° 191 
SeqID n° 192 



LM-1 156.1 
LM-1 157.1 
LM-1 158.1 
LM-1 159.1 
LM-1 160.1 
LM-1 161.1 



From 778262 
From 777692 
From 777209 
From 776761 
From 776398 
From 775744 



to 778558 
to 778207 
to 777496 
to 777102 
to 776697 
to 776247 



Unknown, similar to 
transcriptional regulator 
(Lacl family) 
Unknown, similar to alpha- 

1 ,6-mannanase 
Unknown, similar to sugar 

ABC transporter, 
periplasmic sugar-binding 

protein 
Unknown, similar to ABC 
transporter, permease 
protein 
Unknown, similar to 
putative sugar ABC 
transporter, permease 
protein 
Unknown 
Unknown, highly similar to 
gluconate kinase 
Unknown, similar to . 
lipoate-protein ligase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
ATP/GTP-binding protein 
Unknown, similar to 
unknown proteins 
Unknown 
Unknown 
Unknown 
Unknown, similar to 
hypothetical proteins 
Unknown; Similar to ABC 
transporter (ATP-binding 
protein) 
unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
a bile acid 7-alpha 

dehydratase 
unknown, similar to 
transcription regulator 
Crp/Fnr family 
Unknown, weakly similar to 
a putative haloacetate 
dehalogenase 
Unknown 
Unknown 
Unknown 
unknown 
Unknown, hypothetical 
Unknown 
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SeqID n° 193 LM-1 164.1 From 773715 to 775715 



SeqID n° 194 LM-1 165.1 From 772996 to 773700 
SeqID n° 195 LM-1 166.1 From 772310 to 772999 



SeqID n° 196 LM-1 167.1 From 771949 to 772326 

SeqID n° 197 LM-1168.1 From 771026 to 771685 

SeqID n° 198 LM-1 169.1 From 769639 to 771012 

SeqID n° 199 LM-1 17.1 From 2786435 to 2787373 

SeqID n° 200 LM-1 171.1 From 767766 to 769619 



SeqID n° 201 LM-1 172.1 From 766810 to 767742 
SeqID n° 202 LM-1 173.1 From 766351 to 766797 

SeqID n° 203 LM-1 174.1 From 765683 to 766354 



SeqID n° 204 LM-1 175.1 From 764465 to 765469 



SeqID n° 205 LM-1 176.1 From 763798 to 764307 

SeqID n° 206 LM-1 181 .1 From 760455 to 760859 

SeqID n° 207 LM-1 182.2 From 759972 to 760310 

SeqID n° 208 LM-1 183.2 From 759479 to 759958 

SeqID n° 209 LM-1 184.3 From 2916291 to 2916890 



SeqID n°210 

SeqID n° 211 

SeqID n° 212 
SeqID n°213 

SeqID n° 214 

SeqID n° 215 

SeqID n°216 



LM-1 186.1 From 2916946 to 2917266 

LM-1 189.1 From 2917389 to 2918051 

LM-1 19.2 From 1239842 to 1240201 

LM-1 190.1 From 2918048 to 2919169 

LM-1 195.1 From 2919166 to 2921343 

LM-1 196.1 From 2921340 to 2922371 

LM-1 198.1 From 2922429 to 2923223 



Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 
transcriptional regulator 
(GntR family) 
Unknown, similar to 
putative transcription 

regulator 
Unknown, similar to 6- 
phospho-beta-glucosidase 
Unknown, secreted protein 
with 1 GW repeat 
Unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 
IIABC component 
Unknown 
Unknown, similar to ribose 
5-phosphate isomerase 

Unknown, similar to 
Ribulose-5-Phosphate 3- 
Epimerase 
Unknown, similar to 
transcriptional regulator 
(Lacl family) 
Unknown, similar to 
transcription regulator 
Unknown 
Unknown 
Unknown 
Unknown, similar to yeast 
protein Frm2p involved in 
fatty acid signaling 
Unknown, similar to 

thioredoxin 
Unknown, similar to 
phosphoglucomutase 

unknown 
unknown; similar to 
unknown proteins 
Unknown, similar to a 
maltose phosphorylase 
Unknown, similar to 
oxidoreductases 
Unknown, highly similar to 
an E. coli protein 



WO 01/77335 

SeqID n° 217 
SeqID n°218 

SeqID n°219 

SeqID n° 220 

SeqID n° 221 

SeqID n° 222 

SeqID n° 223 

SeqID n° 224 
SeqID n° 225 

SeqID n° 226 

SeqID n° 227 

SeqID n° 228 
SeqID n° 229 

SeqID n° 230 
SeqID n° 231 
SeqID n° 232 

SeqID n° 233 

SeqID n° 234 

SeqID n° 235 
SeqID n° 236 

SeqID n° 237 

SeqID n° 238 

SeqID n° 239 
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LM-1 199.1 From 2923236 to 2924288 Unknown, similar to alcohol 

dehydrogenase 

LM-1201.1 From 2924307 to 2925158 Unknown, similar to sugar 

ABC transporter permease 
protein 

LM-1202.1 From 2925145 to 2926026 Unknown, similar to sugar 

ABC transporter permease 
protein 

LM-1203.1 From 2926101 to 2927393 Unknown, similar to sugar 

binding protein (ABC 
transporter) 

LM-1205.1 From 2927390 to 2928631 Unknown, similar to 

Sucrose phosphorylase 

LM-1206.1 From 2928668 to 2929090 unknown, weakly similar to 

sucrose phosphorylase 

LM-1208.1 From 2929315 to 2930340 Unknown, similar to 

transcriptional regulator 

LM-1209.1 From 2930479 to 2932473 unknown 

LM-121.1 From 1239039 to 1239797 unknown, similar to rRNA 

methylase 

LM-1210.1 From 2932609 to 2933148 Unknown, similar to 

unknown proteins 

LM-1212.1 From 2933303 to 2934715 unknown, similar to 

transmembrane efflux 
proteins 

LM-1213.1 From 2934756 to 2935070 Unknown, highly similar to 

B. subtilis YulD protein 
LM-1214.1 From 2935083 to 2935904 unknown, highly similar to 

rhamnulose-1-phosphate 
aldolase 

LM-1216.1 From 2935917 to 2937179 unknown, highly similar to 

L-rhamnose isomerase 
LM-1218.1 From 2937192 to 2938643 unknown; similar to 

rhamnulokinase 

LM-122.1 From 1237934 to 1239013 unknown, similar to endo- 

1,4-beta-glucanase and to 
aminopeptidase 

LM-1220.1 From 2938662 to 2939930 Unknown, similar to sugar 

transport proteins 

LM-1221.1 From 2940053 to 2941033 Unknown, similar to AraC- 

type regulatory protein 
LM-1222.1 From 2941083 to 2941544 unknown 
LM-1223.2 From 2941599 to 2942219 unknown, highly similar to 

B. subtilis Jag protein 
LM-1224.3 From 2942216 to 2943079 Unknown, highly similar to 

B. subtilis SpolllJ protein 
LM-1225.2 From 884170 to 885270 Unknown, similar to 

excinuclease ABC, chain C 
(UvrC) 

LM-1228.1 From 882962 to 884065 Unknown, similar to B. 

subtilis YxjH and YxjG 
proteins 



WO 01/77335 



PCT/FR01/01118 



58 



SeqID n° 240 LM-123.1 From 1236836 to 1237822 

SeqID n° 241 LM-1230.1 From 882328 to 882705 

SeqID n° 242 LM-1231 .1 From 881992 to 882249 

SeqID n° 243 LM-1239.1 From 872616 to 875258 



SeqID n°244 LM-1241.1 From 871940 to 872410 
SeqID n° 245 LM-1 242.1 From 870587 to 871807 



SeqID n° 246 LM-1243.1 From 869095 to 870480 

SeqID n° 247 LM-1 245.1 From 867163 to 868800 

SeqID n° 248 LM-1246.1 From 866577 to 866990 

SeqID n° 249 LM-1 249.1 From 864791 to 865537 

SeqID n° 250 LM-125.1 From 1235670 to 1236539 

SeqID n° 251 LM-1250.1 From 863798 to 864688 

SeqID n° 252 LM-1251.1 From 863322 to 863606 

SeqID n° 253 LM-1 254.1 From 861731 to 862642 

SeqID n° 254 LM-1255.1 From 859656 to 861617 

SeqID n° 255 LM-1 257.1 From 855759 to 859406 

SeqID n° 256 LM-1 258.1 From 854745 to 855581 

SeqID n° 257 LM-1259.1 From 854443 to 854748 

SeqID n° 258 LM-1261.1 From 852704 to 854338 

SeqID n° 259 LM-1 262.1 From 851225 to 852505 

SeqID n° 260 LM-1263.1 From 850216 to 851184 

SeqID n° 261 LM-1 265.1 From 849314 to 850138 

SeqID n° 262 LM-1 266.1 From 848804 to 849193 

SeqID n° 263 LM-1267.1 From 848126 to 848788 



unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase (autolysin) 
unknown, conserved 
hypothetical protein 
unknown, similar to B. 
subtilis protein YsdA 
unknown, similar to cation 
(clacium) transporting 
ATPase 
Unknown 
unknown, similar to 
Tetracycline resistance 
protein 
unknown, highly similar to 
hexose phosphate 
transport protein 
Unknown, ABC transporter 
(ATP binding protein) 
unknown, similar to B. 
subtilis YrkR protein 

unknown 
unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase (autolysin) 
Unknown; similar to 
transcriptional regulator 
unknown, similar to 
transposase 
unknwon 
unknown, highly similar to 
fructose- 1,6- 
bisphosphatase 
unknown, highly similar to 
pyruvate-flavodoxin 

oxidoreductase 
unknown, similar to 

transposases 
unknown, similar to 

transposases 
Unknown, similar to 
transport protein 
unknown, similar to 3- 
hydroxy-3-methylglutaryl- 
coenzyme a reductase 
unknown 
unknown, similar to 
oxydoreductases 
unknown, similar to 
transcriptional regulators 
unknown 
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SeqID n° 264 


LM-1268.1 


From 847552 


to 848109 


SeqID n° 265 
SeqID n° 266 


LM-1269.1 
LM-1271.2 


From 846594 
From 843949 


to 847442 
to 846579 


SeqID n° 267 


LM-1 273.1 


From 939106 


to 939819 


oeqiu n zoo 


Llvl- 1 Z / O.Z 


rrom yo/ / oy 


to yoyuzo 


SeqID n° 269 
SeqID n° 270 


LM-1 277.2 
LM-1 278.1 


From 937202 
From 936136 


to 937594 
to 936603 


SeqID n° 271 


LM-1 28.1 


From 1233904 


to 1234434 


ocij i lj n z/z 


Llvl- IZOO. 1 


rrom yooyoy 


to yoo IOO 


oecjiLj n z / o 


Llvl- I ZOO. I 


rrom yozzo/ 


to yooooz 


QonI n n° 97/1 


1 M i OAR i 
Llvl- IZOO. 1 


rrom yo i^fo i 


to yozuou 


oecjiLJ n z / o 


1 M A OSX7 1 
Llvl- I ZO/ . I 


rrom youo/ i 


to yo i^fovj 


SeqID n° 276 


LM-1 288.1 


From 930220 


to 930693 


SeqID n° 277 


LM-1 289.1 


From 929892 


to 930236 


SeqID n° 278 


LM-1 29.1 


From 1233366 


to 1233815 


SeqID n° 279 


LM-1290.1 


From 928738 


to 929742 


SeqID n° 280 


LM-1 291.1 


From 928311 


to 928721 




Livi- izyo. i 


rrom yz/ yoz 


CO oZOOUO 


SeqID n° 282 


LM-1294.1 


From 927110 


to 927946 


SeqID n° 283 


LM-1295.1 


From 926511 


to 926858 


SeqID n° 284 


LM-1296.1 


From 926229 


to 926507 


SeqID n° 285 


LM-1 297.1 


From 924917 


to 926023 


SeqID n° 286 


LM-1298.1 


From 924542 


to 924898 



unknown, some similarity 
to acatyltransferases 
unknown 
unknown, similar to cation 
transporting ATPase 
Unknown, similar to 
transcription regulator 
(GntR family) 
unknown, similar to PTS 
system, cellobiose-specific 
IIC component 
Unknown 
unknown, similar to B. 
subtilis YdcK protein 
unknown, similar to 
unknown proteins 
unknown, conserved 
hypothetical protein 
unknown, similar to 
transport proteins 
Indirect negative regulation 
of sigma B dependant gene 
expression (serine 
phosphatase) 
RNA polymerase sigma-37 

factor (sigma-B) 
sigma- B activity negative 
regulator RsbW 
anti-anti-sigma factor 
(antagonist of RsbW) 
unknown, similar to 
unknown proteins 
unknown, highly similar to 
serine phosphatase RsbU 
unknown, highly similar to 
positive regulation of 
sigma-B activity 
unknown, highly similar to 
negative regulation of 
sigma-B activity 
unknown, highly similar to 
positive regulator of sigma- 
B activity 
Unknown, similar to B. 
subtilis YdcE protein 
Unknown, similar to B. 
subtilis YdcD protein 
Unknown, similar to alanine 

racemase 
Unknown, similar to holo- 
acyl-carrier protein 
synthase 
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SeqID n° 287 LM-13.1 From 2719735 to 2720484 



SeqID n° 288 


LM-130.1 


From 1232884 


to 1233351 


OCLj 1 1 / 11 


l—IVI" 1 0\J\J . 1 


Prom 




SeqID n° 290 


LM- 1302.1 


From 921537 


to 923012 


SealD n° 291 


LM-1303 1 

1 — IVI 1 JUJ. 1 


I lUIII <U /L. I \J\J*L 




SeqID n° 292 
SeqID n° 293 
SeqID n° 294 


LM- 1304.1 
LM-1 306.1 
LM-1 307.1 


From 920452 
From 918135 
From 917164 


to 920928 
to 918902 
to 918117 


SeqID n° 295 


LM-1309.1 


From 916436 


to 917164 


SeqID n° 296 


LM-1 31.1 


From 1232272 


to 1232838 


SeqID n° 297 


LM-1 31 2.1 


From 915090 


to 916415 


SeqID n° 298 


LM-1 31 3.1 


From 914735 


to 915070 


SeqID n° 299 


LM-1 31 4.1 


From 914405 


to 914734 


SeqID n° 300 


LM-1 31 6.1 


From 912372 


to 914390 


SeqID n° 301 


LM-1 31 7.1 


From 910998 


to 912164 


SeqID n° 302 


LM-1318.1 


From 910517 


to 910870 


SeqID n° 303 
SeqID n° 304 
SeqID n° 305 
SeqID n° 306 
SeqID n° 307 


LM-1319.1 
LM-1 320.1 
LM-1321.1 
LM-1 322.1 
LM-1 325.1 


From 910173 
From 909159 
From 908499 
From 907979 
From 905962 


to 910484 
to 910157 
to 909056 
to 908467 
to 907524 


SeqID n° 308 


LM-1 326.1 


From 903837 


to 905510 


SeqID n° 309 


LM-1 327.1 


From 902773 


to 903840 



SeqID n° 310 
SeqID n° 311 



LM-1 328.1 From 901837 to 902751 
LM-1 329. 1 From 900303 to 901 835 



Unknown, similar to 
creatinine amidohydrolase 
unknown, similar to 
unknown proteins 
unknown, similar to 
protoporphyrinogen IX and 
coproporphyrinogen III 

oxidase (HemY) 
Unknown, similar to B. 
subtilis YbtB protein 
unknown, similar to B. 
subtilis YdbS protein 
unknown 
unknown 
unknown, similar to 
oxidoreductases 
unknown, similar to B. 
subtilis NagB protein 
(glucosamine-6-phosphate 
isomerase) 
unknown, similar to 
unknown protein 
unknown, similar to PTS 
system, Lichenan-specific 
enzyme (iC component 
unknown, similar to PTS 
system, beta-glucoside 
enzyme I IB component 
Unknown, similar to PTS 
system enzyme IIA 

component 
Unknown; Similar to 
transcriptional regulator 
(antiterminator) 
Unknown; similar to 
antibiotic resistance protein 

Unknown, similar to B. 
subtilis YtcD protein 
unknown 
unknown 
unknown 
unknown 
Unknown, similar to ATP- 
dependent RNA helicase 

Unknown, similar to 
phosphomannomutase 
unknown 
unknown 
unknown, similar to oligo- 
1,6-glucosidase 
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SeqID n° 312 LM-1331.1 From 899454 to 900287 

SeqID n° 313 LM-1 332.1 From 898589 to 899467 

SeqiD n° 314 LM-1 333.3 From 897273 to 898592 

SeqID n° 315 LM-1335.3 From 1104048 to 1104851 

SeqID n° 316 LM-1337.2 From 1102858 to 1103757 

SeqID n° 317 LM-1 338.1 From 1099266 to 1102706 

SeqID n° 318 LM-1 339.1 From 1097786 to 1098988 

SeqID n° 319 LM-1341.1 From 1097322 to 1097603 

SeqID n° 320 LM-1342.1 From 1096874 to 10971 16 

SeqID n° 321 LM-1343.2 From 1095975 to 1096835 

SeqID n° 322 LM-1349.1 From 1093965 to 1095803 

SeqID n° 323 LM-1351.1 From 1093000 to 1093773 

SeqID n° 324 LM-1 352.1 From 1092255 to 1092869 

SeqID n° 325 LM-1 353.1 From 1091110 to 1092054 

SeqID n° 326 LM-1354.1 From 1090375 to 1091043 

SeqID n° 327 LM-1 357.1 From 1088941 to 1090362 

SeqID n° 328 LM-1358.1 From 1087409 to 1088854 

SeqID n° 329 LM-1 359.1 From 1086772 to 1087434 

SeqID n° 330 LM-1360.1 From 1086100 to 1086630 



unknown, similar to sugar 
ABC transporter, permease 

protein - 
unknown, similar to sugar 
ABC transporter, permease 
protein 
Unknown, similar to 
putative sugar ABC 
transporter, periplasmic 
sugar-binding protein, 
unknown, highly similar to 
teichoic acid translocation 
permease protein TagG 
unknown, similar to metal 

binding protein 
unknown, highly similar to 

pyruvate carboxylase 
unknown, similar to cell- 
division protein RodA and 
FtsW 

unknown, similar to B. 
subtilis YlaN protein 
unknown, similar to B. 
subtilis Ylal protein 
unknown 
unknown, similar to GTP- 
binding ptotein TypA/BipA 
(tyrosine phosphorylated 
protein A) from E. coli and 
B. subtilis (YlaG) 
unknown, similar to 
extragenic suppressor 
protein SuhB and to myo- 
inositol-! (or 4)- 
monophosphatase 
unknown, similar to B. 
subtilis YktB protein 
unknown, similar to 
membrane and transport 

proteins 
unknown, similar to ABC 
transporter (permease) 
unknown, conserved 
hypothetical protein 
unknown, similar to sensor 
protein histidine kinases (2 
components regulatory 
systems) 

unknown, similar to 
transcription response 
regulator 
unknown 
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SeqID n° 331 

SeqlD n° 332 

SeqID n° 333 
SeqID n° 334 



LM-1361.1 From 1085806 to 1086078 

LM-1 363.1 From 1084853 to 1085806 

LM-1364.1 From 1084413 to 1084724 

LM-1367.1 From 1082822 to 1084225 



SeqID n° 335 LM-1368.1 From 1081183 to 1082817 



SeqID n° 336 
SeqID n° 337 
SeqID n° 338 

SeqID n° 339 
SeqID n° 340 

SeqJD n° 341 

SeqID n° 342 

SeqID n° 343 

SeqID n 6 344 

SeqID n° 345 

SeqID n° 346 



LM-1369.1 From 1080095 to 1081072 

LM-1370.1 From 1078977 to 1080092 

LM-1 371.1 From 1077592 to 1078143 

LM-1 372.1 From 1 076989 to 1 077543 

LM-1 373.1 From 1075860 to 1076858 

LM-1 374.1 From 1075360 to 1075848 

LM-1 376.1 From 1074324 to 1075325 

LM-1377.1 From 1073813 to 1074295 

LM-1 379. 1 From 1 073552 to 1 073800 

LM-1 380.1 From 1073146 to 1073568 

LM-1381.1 From 1072664 to 1073149 



SeqID n° 347 LM-1383.1 From 1071462 to 1072685 



unknown, similar to B. 
subtilis YktA protein 
unknown, similar to L- 
lactate dehydrogenase 
unknown 
unknown, highly similar to 

dihydrolipoamide 
dehydrogenase, E3 subunit 
of pyruvate dehydrogenase 

complex 
unknown, highly similar to 
pyruvate dehydrogenase 
(dihydrolipoamide 
acetyltransferase E2 
subunit) 
unknown, highly similar to 
pyruvate dehydrogenase 

(E1 beta subunit) 
unknown, highly similar to 
pyruvate dehydrogenase 
(E1 alpha subunit) 
unknown, similar to 
formylmethionine 
deformylase and to B. 
subtilis YkrB protein 
unknown, similar to B. 
subtilis YdfE protein 
Unknown, similar to 
molybdopterin biosynthesis 
protein MoeB 
unknown, similar to 
molybdenum cofactor 
biosynthesis protein B 

unknown, similar to 
molybdenum cofactor 
biosynthesis protein A 
unknown, similar to 
molybdenum cofactor 
biosynthesis protein C 
unknown, similar to 
molybdopterin converting 
factor (subunit 1). 
Unknown, similar to 
molybdopterin converting 
factor, subunit 2 
unknown, similar to 
molybdopterin-guanine 
dinucleotide biosynthesis 
MobB 
unknown, similar to 
molybdopterin biosynthesis 
protein moeA 
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SeqlDn°348 LM-1385.1 From 1070597 to 1071367 

SeqlDn°349 LM-1386.1 From 1069821 to 1070492 

SeqlDn°350 LM-1387.1 From 1069156 to 1069818 

SeqlDn°351 LM-1388.1 From 1068594 to 1069175 

SeqlDn°352 LM-1389.1 From 1067723 to 1068526 



SeqID n° 353 LM-1390.1 From 1066397 
SeqID n° 354 LM-1391.1 From 1064521 



SeqID n° 355 

SeqID n° 356 

SeqID n° 357 

SeqID n° 358 

SeqID n° 359 
SeqID n° 360 



LM-1 392.1 From 1063061 

LM-1 394.1 From 1062108 

LM-1396.1 From 1061291 

LM-1 398.3 From 1059872 

LM-1401.1 From 235524 

LM-1402.1 From 234414 



to 1067662 
to 1066377 



to 1064524 

to 1063064 

to 1062115 

to 1061275 

to 237020 
to 235409 



SeqID n° 361 LM-1404.1 From 233858 to 234337 



SeqID n° 


362 


LM-1 405.1 


From 233491 


to 233865 


SeqID n° 


363 


LM-1 407.1 


From 232662 


to 233477 


SeqID n° 


364 


LM-1408.1 


From 230840 


to 231766 


SeqID n° 


365 


LM-1410.1 


From 229840 


to 230724 


SeqID n° 


366 


LM-1411.1 


From 229045 


to 229824 


SeqID n° 367 


LM-1413.1 


From 226854 


to 228929 


SeqID n° 


368 


LM-1415.1 


From 224562 


to 226508 



unknown, similar to 
molybdate binding protein 
Unknown, similar to 
molybdenum transport 

protein 
unknown, similar to ABC 
transporter 
unknown, weakly similar to 

molybdopterin-guanine 
dinucleotide biosynthesis 

protein A 
unknown, highly similar to 
B. subtilis YoaT protein 
unknown 
unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 
IIABC 
unknown, similar to 

glycerol kinase 
unknown, similar to 

transketolase 
unknown, similar to 

transketolase 
unknown, similar to 
hypothetical proteins 
lysyl-tRNA synthetase 
Unknwon, conserved 
hypothetical protein 
unknown, similar to 7,8- 
dihydro-6- 
hydroxymethylpterin 
pyrophosphokinase 
unknown, highly similar to 
dihydroneopterin aldolase 
unknown, highly similar to 
dihydropteroate synthases 
unknown, highly similar to 
cysteine synthase 
Unknwon, conserved 
hypothetical protein 
Unknwon, conserved 
hypothetical protein 
unknown, highly similar to 
cell division protein ftsH 
Unknown, fusion protein, 
N-terminal part similar to B. 
subtilis YacA protein, O 
terminal part similar to 
hypoxanthine-guanine 
phosphoribosyltransferase 
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SeqlDn°369 LM-1418.1 From 224027 to 224455 

SeqlDn°370 LM-1420.1 From 223490 to 223876 

SeqlDn°371 LM-1426.1 From 221370 to 222959 

SeqID n° 372 LM-1429.1 From 217800 to 221339 

SeqID n° 373 LM-1430.1 From 217125 to 217685 

SeqlDn°374 LM-1432.1 From 216434 to 217018 

SeqlDn°375 LM-1433.2 From 215721 to 216344 

SeqlDn°376 LM-1434.2 From 214486 to 215427 



SeqID n° 377 
SeqID n° 378 



LM-1435.1 
LM-1436.1 



From 213735 
From 213336 



SeqID n° 380 
SeqID n° 381 
SeqID n° 382 



LM-1 438.1 
LM-1439.1 
LM-1 442.1 



From 212366 
From 211425 
From 209470 



SeqID n° 384 
SeqID n° 385 

SeqID n° 386 

SeqID n° 387 

SeqID n° 388 



LM-1445.1 
LM-1 446.1 

LM-1447.1 

LM-1 448.1 

LM-1449.3 



From 205819 
From 204624 

From 203640 

From 202641 

From 201217 



to 214409 
to 213668 



SeqID n° 379 LM-1437.1 From 212824 to 213285 



to 212689 
to 212294 
to 21 1389 



SeqID n° 383 LM-1444.1 From 207739 to 209271 



SeqID n° 389 LM-1451.1 From 1891236 

SeqID n° 390 LM-1453.1 From 1891944 

SeqID n° 391 LM-1454.1 From 1892603 

SeqID n° 392 LM-1457.1 From 1893497 



to 207408 
to 205577 

to 204353 

to 203597 

to 202590 

to 1891880 
to 1892600 

to 1893478 
to 1895464 



SeqID n° 393 LM-1458.1 From 1895461 to 1896219 



Unknown, 
polyribonucleotide 
nucleotidyltransferase 

domain present 
unknown, similar to B. 
subtilis DivIC protein 
Unknown, conserved 
membrane-spanning 

protein 
transcription-repair 

coupling factor 
Unknwon, similar to 
peptidyl-tRNA hydrolase 

Unknown 
Unknown, similar to B. 
subtilis general stress 

protein 
Unknwon, similar to L- 
lactate dehydrogenase 

Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, hypothetical 
lipoprotein 
Unknwon 
phospholipase C 
actin-assembly inducing 

protein precursor 
Zinc metalloproteinase 

precursor 
listeriolysin O precursor 

phosphatidylinositol- 
specific phospholipase c 
listeriolysin positive 
regulatory protein 
phosphoribosyl 
pyrophosphate synthetase 
unknown, highly similar to 
UDP-N-acetylglucosamine 

pyrophosphorylase 
Unknown, weakly similar to 
thiamin pyrophosphokinase 
Unknown, similar to 
ribulose-5-phosphate 3- 
epimerase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
putative serine/threonine- 
specific protein kinase 

Unknown, similar to 
putative phosphoprotein 
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SeqID n° 394 LM-1459.1 From 1896242 to 1897576 

SeqID n° 395 LM-1461.1 From 1897577 to 1898515 

SeqID n° 396 LM-1463.1 From 1898529 to 1900922 

SeqiD n° 397 LM-1465.1 From 1900927 to 1902126 

SeqID n° 398 LM-1466.2 From 1902484 to 1903101 

SeqID n° 399 LM-1467.2 From 1903120 to 1903995 

SeqID n° 400 LM-1 468.1 From 1904152 to 1905864 

SeqID n° 401 LM-1469.1 From 1905952 to 1906551 

SeqID n° 402 LM-1 470.1 From 1906592 to 1907221 

SeqID n° 403 LM-1471.1 From 1907218 to 1907919 

SeqID n° 404 LM-1472.1 From 1907916 to 1908830 

SeqID n° 405 LM-1473.1 From 1908827 to 1909591 

SeqID n° 406 LM-1474.1 From 1909614 to 1912826 

SeqID n° 407 LM-1 476.1 From 1912819 to 1913910 

SeqID n° 408 LM-1 477.1 From 1913907 to 1915187 

SeqID n° 409 LM-1478.1 From 1915175 to 1916086 

SeqID n° 410 LM-1480.1 From 1916166 to 1917452 



phosphatase 

Unknown, similar to RNA- 
binding Sun protein 
Unknown, similar to 
methionyl-tRNA 
formyltransferase 
unknown, similar to 
primosomal replication 
factor Y 
unknown, similar to 
pantothenate metabolism 
flavoprotein homolog 
unknown, similar to 
guanylate kinases 
unknown, simlar to 
conserved hypothetical 
protein 
unknown, similar to 
fibronectin binding proteins 
unknown, similar to 
conserved hypotheticl 
proteins 
unknown, highly similar to 

orotate 
phosphoribosyltransferases 
unknown, highly similar to 
orotidine 5 -phosphate 
decarboxylases 
unknown, highly similar to 
dihydroorotase 
dehydrogenase 
unknown, highly similar to 

dihydroorotate 
dehydrogenase (electron 

transfer subunit) 
unknown, highly similar to 
carbamoyl-phosphate 
synthetase (catalytic 
subunit) 
unknown, highly similar to 

carbamoyl-phosphate 
synthetase (glutaminase 
subunit) 

unknown, highly similar to 

dihydroorotase 
unknown, highly similar to 
aspartate 
carbamoyltransferase 
unknown, highly similar to 
uracil permease 
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SeqID n° 411 LM-1481.2 From 1917581 to 1918132 



SeqID n° 412 LM-1482.2 From 1918363 to 1918593 
SeqID n° 413 LM-1483.2 From 645397 to 645858 



SeqID n° 414 LM-1484.1 From 645878 to 647590 

SeqID n° 415 LM-1486.1 From 647587 to 649404 

SeqID n° 416 LM-1487.1 From 649520 to 649819 

SeqID n° 417 LM-1491.1 From 651794 to 652420 

SeqID n° 418 LM- 1492.1 From 652589 to 653044 

SeqID n° 419 LM-1493.1 From 653041 to 653982 

SeqID n° 420 LM-1494.1 From 654078 to 65461 1 

SeqID n° 421 LM-1495.1 From 654608 to 654850 

SeqID n° 422 LM-1498.1 From 654957 to 656708 



SeqID n° 423 LM-1 499.1 From 656749 to 657243 
SeqID n° 424 LM-15.1 From 2720497 to 2721489 

SeqID n° 425 LM-1500.1 From 657323 to 658465 



SeqID n° 426 
SeqID n° 427 
SeqID n° 428 

SeqID n°429 
SeqID n° 430 
SeqID n° 431 

SeqID n° 432 
SeqID n°433 

SeqID n° 434 
SeqID n° 435 



LM-1501 .1 From 658572 to 659027 

LM-1 502.1 From 659083 to 659475 

LM-1 504.1 From 659572 to 660312 

LM-1 506. 1 From 660398 to 660676 

LM-1 507. 1 From 660692 to 660958 

LM-1 508.1 From 660996 to 661439 

LM-1 51 0.3 From 661470 to 662171 

LM-1 511.3 From 662257 to 663936 

LM-1 516.1 From 669423 to 669950 

LM-1518.1 From 670309 to 672339 



unknown, highly similar to 
pyrimidine operon 
regulatory protein 

unknown 
Unknown, similar to 
transcription regulator 
MarR family 
Unknown, similar to ABC 
transporter, ATP-binding 

protein 
Unknown, similar to ABC 
transporter, ATP-binding 

protein 
Unknown, similar to E. coli 

phage shock protein E 
Unknown, similar to acyl- 

carrier protein 
phosphodiesterase and 
NAD(P)H dehydrogenase 
Unknown, similar to 
transcription regulator 
MarR family 
Unknown, similar to 

oxidoreductase 
unknown, conserved 
hypothetical protein 

Unknown 
Unknown, C-terminal 
domain similar to 
glycerophosphoryl diester 
phosphodiesterase 

Unknown 
Unknown, similar to 
Phosphotriesterase 
Unknown, similar to protein 
kinase 
Unknown 
Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, hypothetical 
Unknown 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
transcription antiterminator 
BgIG family 
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SeqID n° 436 LM-1519.1 From 672341 to 672793 



SeqID n° 437 LM-1 520.1 From 672794 to 673855 



SeqID n° 438 LM-1521.1 From 673870 to 674178 



SeqID n° 439 LM-1523.1 From 674205 to 675473 



SeqID n° 440 LM-1 524.1 From 675563 to 676267 

SeqID n° 441 LM-1525.1 From 676372 to 676788 

SeqID n° 442 LM-1 527.1 From 676802 to 677395 

SeqID n° 443 LM-1528.1 From 678494 to 679123 

SeqID n° 444 LM-1 529.1 From 679763 to 680296 



SeqID n° 445 LM-1 53.1 From 1216820 to 1218178 

SeqID n° 446 LM-1530.1 From 680360 to 681277 

SeqID n° 447 LM-1 532.1 From 681543 to 683423 

SeqID n° 448 LM-1 533.1 From 683519 to 684247 

SeqID n° 449 LM-1 535.1 From 684390 to 685040 

SeqID n° 450 LM-1536.3 From 685083 to 686903 

SeqID n° 451 LM-1 537.2 From 468169 to 469437 



SeqID n° 452 
SeqID n° 453 
SeqID n° 454 



LM-1 538.1 
LM-1 54.1 
LM-1 540.1 



From 467519 
From 1215993 
From 466517 



SeqID n° 455 LM-1 541.1 From 465934 
SeqID n° 456 LM-1545.1 From 459681 



SeqID n° 457 
SeqID n° 458 
SeqID n° 459 



LM-1 547.1 
LM-1 549.1 
LM-1 55.1 



i 

From 457021 
From 454534 
From 1215128 



to 468136 
to 1216487 
to 467362 

to 466419 

to 465722 



to 458913 
to 456936 
to 1215679 



Unknown, similar to PTS 
system, fructose-specific 

HA component 
Unknown, similar to PTS 
system, fructose-specific 

IIC component 
Unknown, similar to PTS 
system, fructose-specific 

IIB component 
Unknown, similar to an E. 
coii putative tagatose 6- 
phosphate kinase 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
methyltransferase 
Unknown 
Unknown, similar to a 
transcription regulator 
(surface protein PAg 
negative regulator par) 
Unknown, similar to 
cobyrinic acid a,c-diamide 
synthase 
Unknown, similar to 
oxidoreductase 
Unknown, similar to heavy 
metal-transporting ATPase 
Unknown 
Unknown, similar to 
putative transaldolase 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown, weakly similar to 
a module of peptide 
synthetase 
unknown 
unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
unknown proteins 
Unknown, putative 
peptidoglycan linked 
protein (LPXTG motif) 
Internalin B 
Internalin A 
Unknown, similar to 
transcriptional regulator 
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SeqID n° 460 LM-1551.1 From 453107 to 453853 

SeqID n° 461 LM-1 553.1 From 452524 to 453093 

SeqID n° 462 LM-1 554.1 From 451531 to 452406 

SeqID n° 463 LM- 1557.1 From 448926 to 451508 

SeqID n° 464 LM-1 558.1 From 447804 to 448910 



SeqID n° 465 LM-1 56.1 From 1213636 to 1215087 
SeqID n° 466 LM-1 560.1 From 447471 to 447791 



SeqID n° 467 LM-1 561.1 From 447010 to 447474 

SeqID n° 468 LM-1564.1 From 445049 to 447010 

SeqID n° 469 LM-1 566.1 From 444031 to 444888 

SeqID n° 470 LM-1 568.1 From 443303 to 443851 

SeqID n° 471 LM-1 569.1 From 442869 to 443300 

SeqID n° 472 LM-157.1 From 1212983 to 1213426 

SeqID n° 473 LM-1570.1 From 441622 to 442872 



SeqID n° 474 

SeqID n° 475 

SeqID n° 476 
SeqID n° 477 
SeqID n° 478 

SeqID n° 479 

SeqID n° 480 

SeqID n°481 
SeqID n° 482 



LM-1 571.1 

LM-1 572.1 

LM-1 573.1 
LM-1 574.1 
LM-1 575.2 

LM-1 577.4 

LM-1 578.2 

LM-1 579.1 
LM-1 58.1 



From 440758 

From 440056 

From 439677 
From 439214 
From 437482 

From 436379 

From 977051 

From 976065 
From 1211869 



to 441570 

to 440610 

to 439976 
to 439624 
to 438882 

to 437188 

to 978856 

to 977039 
to 1212990 



Unknown, similar to 

oxidoreductase 
Unknown, similar to 
acetyltransferase 
Unknown, similar to 
transcriptional regulator 
(LysR family) 
Unknown, similar to sugar 

hydrolase 
Unknown, similar to PTS 
fructose-specific enzyme 
IIC component 
unknown 
Unknown, similar to PTS 
fructose-specific enzyme 

MB component 
Unknown, similar to PTS 
fructose-specific enzyme 
IIA component 
Unknown, similar to 
transcription antiterminator 
BgIG family 
Unknown, similar to 
Staphylococcus xylosus 
glucose uptake protein 
Unknown, similar to RNA 
polymerase ECF-type 

sigma factor 
Unknown, similar to 
unknown protein 
similar to ethanoJamine 
utilization protein EutQ 
Unknown, similar to rod 
shape-determining protein 
RodA 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
unknown protein 
Unknown 
Unknown 
Unknown, similar to endo- 
1 ,4-beta-xylanase 
Unknown, conserved 
membrane protein 
unknown, similar to heat 
shock protein HtpG 

Unknown 
Unknown, similar to 
ethanolamine utilization 
protein EutH - Escherichia 
coli 
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SeqlDn°483 LM-1 580.1 From 975071 to 976033 Unknown 
SeqID n° 484 LM-1 581 .1 From 974229 to 974975 Unknown 
SeqID n° 485 LM-1 582.1 From 973758 to 974216 unknown, similar to protein- 

tyrosine-phosphatase 
SeqID n° 486 LM-1 583.1 From 972719 to 973459 unknown, similar to 

Nitroflavin-reductase 

SeqID n° 487 LM-1 584.1 From 972197 to 972706 Unknown, similar to B. 

subtilis CspR protein, rRNA 
methylase homolog 

SeqID n° 488 LM-1 585.1 From 971037 to 972176 unknown, similar to B. 

subtilis YhbA protein 

SeqID n° 489 LM-1 586.1 From 969549 to 970496 unknown, similar to B. 

subtilis YkcC protein 

SeqID n° 490 LM-1 587.1 From 968819 to 969424 unknown, conserved 

hypothetical protein 

SeqID n° 491 LM-1589.1 From 967784 to 968779 unknown, similar to lipoate 

protein ligase A 

SeqID n° 492 LM-1 59.1 From 1211308 to 1211853 Unknown, similar to 

Salmonella enterica PduT 
protein 

SeqID n° 493 LM-1 590.1 From 967029 to 967760 unknown, conserved 

hypothetical protein, similar 
to B. subtilis Yhfl protein 

SeqID n° 494 LM-1 592.1 From 966245 to 966913 unknown, similar to sortase 
SeqID n° 495 LM-1 593.1 From 965513 to 966136 unknown, similar to 3- 

methyladenine DNA 
glycosylase 

SeqID n° 496 LM-1 594.1 From 963294 to 965255 unknown, hypothetical 

transmembrane protein 
SeqID n° 497 LM-1 596.1 From 962607 to 963263 unknown, similar to 

transcription regulator 
(TetR/AcrR family) 

SeqID n° 498 LM-1 597.1 From 961469 to 962584 unknown, putative 

membrane protein 

SeqID n° 499 LM-16.1 From 2721559 to 2722857 Unknown, similar to 

membrane proteins 

SeqID n° 500 LM-160.1 From 1211046 to 1211315 Unknown, similar to carbon 

dioxide concentrating 
mechanism protein 

SeqID n° 501 LM-1600.1 From 960748 to 961221 Unknown, similar to ABC 

transporter, ATP-binding 
protein (N-terminal part) 

SeqID n° 502 LM-1 602.1 From 95971 1 to 960631 unknown, similar to 

pantothenate kinase 

SeqID n° 503 LM-1 603.1 From 958783 to 959622 unknown, similar to B. 

subtilis YcgQ protein 

SeqID n° 504 LM-1605.1 From 957724 to 958764 unknown, similar to B. 

subtilis YcgR protein 
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SeqID n c 


' 505 


LM-1606.1 


From 956031 


to 957602 


unknown, similar to ABC 
transporter ATP-binding 
protein (antibiotic 


SeqID n c 










resistance) 


' 506 


LM-1 607.1 


From 953842 


to 955740 


Unknown, similar to 
transcription antiterminator 


SeqID n c 










BgIG family 


' 507 


LM-1 608.1 


From 952318 


to 953769 


Unknown, similar to beta- 
glucosidase 


SeqID n c 


508 


LM-1 609.1 


From 951 956 


to 952306 


Unknown, similar to 
phosphotransferase 
system enzyme IIA 


SeqID n° 


509 


LM-161.1 


From 1210459 


to 1211031 


unknown 


SeqID n° 


510 


LM-1 61 0.1 


From 950673 


to 951941 


Unknown, similar to 
phosphotransferase 


SeqID n° 










system enzyme IIC 


511 


LM-1611.1 


From 950363 


to 950668 


Unknown, similar to PTS 


SeqID n° 










system, MB component 


512 


LM-1613.1 


From 948754 


to 950220 


Unknown, similar to 
succinate semialdehyde 


SeqID n° 










dehydrogenase 


513 


LM-1614.1 


From 947718 


to 948530 


unknown, similar to 
transporters (formate) 


SeqID n° 


514 


LM-1615.1 


From 947101 


to 947604 


Unknown 


SeqID n° 


515 


LM-1617.1 


From 945915 


to 947018 


Unknown 


SeqID n° 516 


LM-1618.1 


From 945520 


to 945912 


Unknown, similar to 












transcription regulator, 












GntR family 


SeqID n° 


517 


LM-1619.1 


From 944243 


to 945379 


unknown, similar to 
membrane proteins 


SeqID n° 


518 


LM-162.1 


From 1209821 


to 1210462 


Unknown, similar to 
Salmonella enterica PduL 
protein 


SeqID n° 


519 


LM-1620.1 


From 943526 


to 944200 


unknown, similar to 


SeqID n° 










phosphoglycerate mutase 


520 


LM-1 622.1 


From 942148 


to 943497 


Unknown, similar to 
glutathione Reductase 


SeqID n° 


521 


LM-1623.1 


From 941615 


to 942112 


unknown 


SeqID n° 


522 


LM-1 624,2 


From 940810 


to 941484 


Unknown 


SeqID n° 


523 


LM-1 625.2 


From 469575 


to 470078 


Unknown 


SeqID n° 


524 


LM-1 626.1 


From 470122 


to 472158 


Unknown, similar to 
penicillin-binding protein 
(D-alanyl-D-alanine 
carboxypeptidase) 


SeqID n° 


525 


LM-1 627.1 


From 472330 


to 472644 


Unknown 


SeqID n° 


526 


LM-1 629.1 


From 472781 


to 473710 


Unknown, similar to B. 
subtilis transcription 


SeqID n° 










regulator LytR 


527 


LM-1 63.1 


From 1209053 


to 1209808 


unknown, similar to 
cobalamin adenosyl 
transferase 
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SeqID n c 


'528 


LM-1631.1 


From 473936 


to 476716 


Unknown, conserved 


SeqID n c 










hypothetical protein 


529 


LM-1 632.1 


From 476960 


to 478447 


Unknown, similar to 
transcription regulator 


SeqID n c 


' 530 


LM-1 634.1 


From 478721 


to 479710 


Unknown, similar to 
penicillin acylase and to 
conjugated bile acid 
hydrolase 


SeqID n° 


531 


LM-1 635.1 


From 479765 


to 481 153 


Unknown, similar to 


SeqID n° 










glutamate decarboxylase 


532 


LM-1636.1 


From 481250 


to 482701 


Unknown, similar to amino 
acid antiporter 


SeqID n° 


533 


LM-1 637.1 


From 483166 


to 483891 


Unknown 


SeqID n° 


534 


LM-1638.1 


From 483934 


to 484599 


Unknown, similar to 
unknown proteins 


SeqID n° 


535 


LM-1 639.1 


From 484620 


to 485375 


Unknown 


SeqID n° 


536 


LM-1 64.1 


From 1208603 


to 1208887 


unknown, similar to 
putative carboxysome 
structural protein 


SeqID n° 


537 


LM-1 644.1 


From 485493 


to 487652 


Unknown, similar to 
unknown proteins 


SeqID n° 538 


LM-1 645.1 


From 487649 


to 488797 


Unknown, conserved 


SeqID n° 










hypothetical proteins 


539 


LM-1 646.1 


From 488802 


to 489749 


Unknown, conserved 
hypothetical protein similar 
to B. subtilis YeaC 


SeqID n° 


540 


LM-1647.1 


From 489905 


to 491500 


Unknown, similar to 
unknown proteins 


SeqID n° 


541 


LM-1 650.1 


From 491634 


to 492917 


Unknown, similar to 


SeqID n° 










permeases 


542 


LM-1652.1 


From 492918 


to 494018 


Unknown, similar to 


SeqID n° 










unknown proteins 


543 


LM-1 653.1 


From 494011 


to 495561 


Unknown, similar to 
hydantoinase 


SeqID n° 


544 


LM-1 655.1 


From 496381 


to 497919 


Unknown, similar to 
transcription regulator (VirR 
from Streptococcus 
pyogenes) 


SeqID n° 


545 


LM-1 656.1 


From 498171 


to 500240 


Unknown, putative 
membrane associated 


SeqID n° 










lipoprotein 


546 


LM-1 658.1 


From 500306 


to 500779 


Unknown 


SeqID n" 


547 


LM-1659.3 


From 500799 


to 501284 


Unknown 


SeqID n° 


548 


LM-1 66.1 


From 1207111 


to 1208571 


unknown, similar to 
acetaldehyde 
dehydrogenase / alcohol 


SeqID n° 


549 








dehydrogenase 


LM-1 66 1.4 


From 2632907 


to 2633761 


Unknown, similar to 
fructose-1 ,6-bisphosphate 
aldolase 
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SeqlDn°550 LM-1662.2 From 2631303 to 2632586 



SeqID n° 551 LM-1 663.1 From 2630285 to 2631310 

SeqID n° 552 LM-1664.1 From 2629208 to 2630278 

SeqID n° 553 LM-1665.1 From 2627816 to 2629087 

SeqID n° 554 LM-1666.2 From 2626442 to 2627713 

SeqID n° 555 LM-1668.2 From 2625414 to 2626361 

SeqID n° 556 LM-1 669.1 From 2624980 to 2625417 

SeqID n° 557 LM-167.1 From 1206590 to 1207111 

SeqID n° 558 LM-1671.1 From 2623125 to 2624411 

SeqID n° 559 LM-1 674.1 From 2622067 to 2623122 

SeqID n° 560 LM-1 676.1 From 2621201 to 2622067 

SeqID n° 561 LM-1678.1 From 2620514 to 2621089 

SeqID n° 562 LM-1 679.1 From 2619415 to 2620491 

SeqID n° 563 LM-168.1 From 1205922 to 1206575 

SeqID n° 564 LM-1681.1 From 2618577 to 2619428 

SeqID n° 565 LM-1 682.1 From 2617239 to 2618276 

SeqID n° 566 LM-1 683.1 From 2616835 to 2617242 

SeqID n° 567 LM-1684.1 From 2615458 to 2616699 

SeqID n° 568 LM-1685.1 From 2614692 to 2615321 



Unknwon, weakly similar to 
human N- 
acetylglucosaminyl- 
phosphatidylinositol 
biosynthetic protein 
Unknown, similar to 
gaiactosyltransferase 
Unknown, conserved 
hypothetical protein 
unknown, highly similar to 
UDP-N-acetylglucosamine 
1-carboxyvinyltransferase 
unknown, highly similar to 
transcription terminator 
factor rho 
Unknwon, similar to 
glycosyl transferases 
wall teichoic acid 
glycosyiation protein GtcA 
Unknown, similar to 
putative carboxysome 
structural protein 
unknown, highly similar to 
homoserine 
dehydrogenase 
unknown, highly similar to 

threonine synthase 
unknown, highly similar to 
homoserine kinase 
Unknwon, similar to 
thymidine kinase 
unknown, highly similar to 
peptide chain release 

factor 1 
Unknown, similar to 
putative carboxysome 
structural protein (eutL) 
Unknown, similar to 
protoporphyrinogen 
oxidase 
Unknown, similar to yeast 
translation initiation protein 

Unknown, similar to 
phosphatases 
unknown, highly similar to 

glycine 
hydroxymethyltransferase 
unknown, highly similar to 
uracil 

phosphoribosyltransferase 
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SeqID n° 569 LM-1686.1 From 2613435 to 2614574 

SeqID n° 570 LM-1691.1 From 2612205 to 2612603 

SeqiD n° 571 LM-1693.1 From 2611482 to 2612198 

SeqID n° 572 LM-1694.1 From 2610592 to 2611104 

SeqID n° 573 LM-1695.1 From 2610056 to 2610595 

SeqID n° 574 LM-1697.1 From 2608515 to 2610029 

SeqID n° 575 LM-1698.1 From 2607605 to 2608477 

SeqID n° 576 LM-170.1 From 1205018 to 1205899 

SeqID n° 577 LM-1700.1 From 2606123 to 2607544 

SeqID n° 578 LM-1701.1 From 2605697 to 2606101 

SeqID n° 579 LM-1 702.1 From 2605337 to 2605573 

SeqID n° 580 LM-1 704.1 From 2603863 to 2605155 

SeqID n° 581 LM-1705.2 From 2602705 to 2603700 

SeqID n° 582 LM-1706.2 From 521863 to 522591 

SeqID n° 583 LM-1707.1 From 522628 to 523521 

SeqID n° 584 LM-1708.1 From 523719 to 525713 

SeqID n° 585 LM-171.1 From 1203634 to 1204998 

SeqID n° 586 LM-1710.1 From 525813 to 526688 

SeqID n° 587 LM-1712.1 From 526754 to 527512 



Unknwon, similar to UDP- 
N-acetylglucosamine 2- 
epimerase 
unknown, highly similar to 

ATP synthase subunit i 
unknown, highly similar to 
H+-transporting ATP 
synthase chain a 
unknown, highly similar to 
H+-transporting ATP 
synthase chain b 
unknown, highly similar to 
H+-transporting ATP 
synthase chain delta 
unknown, highly similar to 
H+-transporting ATP 
synthase chain alpha 
unknown, highly similar to 
H+-transporting ATP 
synthase chain gamma 

Unknown, similar to 
ethanolamine ammonia- 

lyase, light chain 
unknown, highly similar to 
H+-transporting ATP 
synthase chain beta 
unknown, highly similar to 
H+-transporting ATP 
synthase chain epsilon 
Unknown, similar to B. 
subtilis YwzB protein 

unknown, UDP-N- 
acetylglucosamine 1- 
carboxyvinyltransferase 
unknown, similar to MreB- 
like protein 
Unknown 
Unknown, similar to 
transcriptional regulator 
(LysR family) 
Unknown, similar to 
NADH:flavin 
oxidoreductase 
Unknown, similar to 
ethanolamine ammonia- 
lyase, heavy chain 
Unknown, similar to 
shikimate 5- 
dehydrogenase 
Unknown, similar to 3- 
dehydroquinate 
dehydratase 
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SeqID n° 588 


LM-1713.1 


From 527580 


to 528488 


oeqiu n ooy 


LM-I / 14.1 


From 52o563 


to 530323 




LJvl- I / to. I 


rrom oouo id 


to ool lUo 


SeqID n° 591 


LM-1716.1 


From 531294 


to 532253 


SeqID n° 592 


LM-1717.1 


From 532328 


to 532552 


SeqID n° 593 


LM-1718.2 


From 532603 


to 5341 1 1 


SeqID n° 594 


LM-1719.1 


From 534307 


to 534756 


SeqID n° 595 


LM-1 721.1 


From 534753 


to 535421 


SeqID n° 596 


LM-1 722.1 


From 535428 


to 536078 


SeqID n° 597 


LM-1 723.1 


From 536187 


to 538247 


SeqID n° 598 


LM-1 724.1 


From 538251 


to 538853 


SeqID n° 599 


LM-1 725.1 


From 538881 


to 539348 


SeqID n° 600 
SeqID n° 601 


LM-1 726.1 
LM-1 727.1 


From 539365 
From 539774 


to 539763 
to 540424 


SeqID n° 602 


LM-1728.1 


From 540421 


to 541467 


SeqID n°603 


LM-1 729.1 


From 541483 


to 541776 


SeqID n° 604 


LM-1 73.1 


From 1202171 


to 1203592 


SeqID n° 605 


LM-1731.1 


From 541791 


to 543062 


SeqID n° 606 


LM-1733.1 


From 543202 


to 544137 


SeqID n° 607 
SeqID n° 608 


LM-1 734.1 
LM-1 735.1 


From 544923 
From 545609 


to 545501 
to 546289 


SeqID n° 609 


LM-1736.1 


From 546314 


to 546673 



Unknown, similar to 
transcriptional regulator 
(LysR family) 
Unknown, similar to 
acylase 
Unknown, weakly similar to 
esterase 
Unknown, similar to 
transmembrane protein 
Unknown, similar to B. 
subtilis YnzC protein 
Unknown, similar to sugar 

transferase 
Unknown, similar to ribose 
5-phosphate isomerase 

Unknown, similar to 
ribulose-5-phosphate 3 
epimerase 
Unknown, similar to 

transaldolase 
Unknown, similar to 
transcription antiterminator 
BgIG family 
Unknown, similar to 
putative sugar-phosphate 

isomerase 
Unknown, similar to PTS 
fructose-specific enzyme 
HA component 
unknown 
Unknown, similar to 
ribuiose-5-phosphate 3- 
epimerase 
Unknown, similar to polyol 
(sorbitol) dehydrogenase 
Unknown, similar to PTS 
system, Galactitol-specific 
MB component 
Unknown, similar to 
ethanolamine utilization 
protein EutA (putative 
chaperonin) 
Unknown, similar to PTS 
system, Galactitol-specific 
IIC component 
Unknown, similar to 
phosphoribosyl 
pyrophosphate synthetase 
Unknown 
Unknown, conserved 
hypothetical protein 
Unknown 
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SeqID n°610 
SeqID n°611 

SeqID n° 612 
SeqID n°613 



LM-1 737.1 From 546680 to 547138 

LM-174.1 From 1200625 to 1202082 

LM-1 740.1 From 549438 to 549869 

LM-1741.1 From 549916 to 551346 



SeqID n° 614 


LM-1 742.2 


From 551467 


to 552282 


SeqID n° 615 


LM-1 75.1 


From 


1200051 


to 1200632 


SeqID n°616 


LM-1 755.2 


From 


2678302 


to 2679330 


SeqID n° 617 


LM-1 757.1 


From 


2681 167 


to 2682018 


SeqID n° 618 


LM-1 758.1 


From 


2682040 


to 2682462 


SeqID n° 619 
SeqID n° 620 


LM-1 759.1 
LM-176.1 


From 
From 


2682584 
1198637 


to 2682943 
to 1199776 


SeqID n° 621 


LM-1 760.1 


From 


2683187 


to 2684056 


SeqID n° 622 
SeqID n 623 
SeqID n° 624 


LM-1 761.1 
LM-1 762.1 
LM-1 763.1 


From 
From 
From 


2684251 
2684665 
2685297 


to 2684643 
to 26851 02 
to 2686043 


SeqID n° 625 


LM-1 766.1 


From 


2686049 


to 2686846 


SeqID n° 626 


LM-1 769.1 


From 


2686849 


to 2687715 


SeqID n° 627 


LM-1 771.1 


From 


2687691 


to 2688530 


SeqID n° 628 


LM-1 772.1 


From 


2688661 


to 2689323 


SeqiD n° 629 
SeqID n° 630 
SeqID n° 631 
SeqID n° 632 
SeqID n° 633 


LM-1 773.1 
LM-1 774.1 
LM-1 775.2 
LM-1 776.3 
LM-1 777.3 


From 
From 
From 
From 
From 


2689442 
2690371 
2691673 
1441313 
1441581 


to 2690332 
to 2691465 
to 2692080 
to 1441540 
to 1442114 


SeqID n° 634 


LM-1 779.1 


From 


1443873 


to 1445042 



Unknown, weakly similar to 
transcription regulator 
unknown, similar to 
sensory transduction 

histidine kinase 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
Bacillus anthracis 
encapsulation protein 

CapA 
Unknown, similar to 
phosphoglycerate mutase 
unknown, similar to similar 
to two-component 
response regulator 
Unknown, similar to ATP 
binding proteins 
Unknown, similar to 
oxidoreductase, aldo/keto 
reductase family 
Unknown, similar to 
transcription regulators 
(MerR family) 
Unknown 
unknown, similar to 
NADPH-dependent butanol 
dehydrogenase 
Unknown, similar to 
unknown proteins 
ribosomal protein S9 
ribosomal protein L13 
unknown, highly similar to 
pseudouridylate synthase I 
Unknown, highly similar to 
B. subtilis YbaF protein 
Uknown similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, conserved 
hypothetical protein 
Unknown 
Unknown 
ribosomal protein LI 7 
Unknown 
Unknown, modulates DNA 

topology 
Unknown, similar to Acetyl- 
CoA:acetyltransferase 
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SeqID n° 635 LM-1780.2 From 1445192 to 1446358 



SeqID n°636 
SeqID n° 637 

SeqID n° 638 

SeqID n° 639 

SeqID n° 640 

SeqID n° 641 

SeqID n°642 



LM-1781.2 From 1446383 to 1446982 

LM-1 782. 1 From 1 447052 to 1 448296 

LM-1783.1 From 1448315 to 1449583 

LM-1784.1 From 1449647 to 1450684 

LM-1 785.1 From 1450701 to 1451597 

LM-1 787.1 From 1451813 to 1452799 

LM-1 789.1 From 1452796 to 1454310 



SeqID n° 643 LM-179.1 From 1197172 to 1198047 



SeqID n° 644 
SeqID n° 645 



LM-1 790.1 From 1454347 to 1455177 
LM-1791.2 From 1455316 to 1456662 



SeqID n° 646 LM-1 794.2 From 1456775 to 1457446 



SeqID n° 647 LM-1795.1 From 1457461 to 1458387 



SeqID n° 648 LM-1 796.1 From 1458389 to 1459045 



SeqID n° 649 LM-1 798.1 From 1459049 to 1460242 



SeqID n° 650 
SeqID n° 651 



LM-1799.1 From 1460534 to 1461094 
LM-1 8.1 From 2722884 to 2723156 



SeqID n° 652 LM-1800.2 From 1461343 to 1461726 



Unknown, similar to 
hydroxy-3-methylglutaryl 
coenzyme A synthase 
Unknown 
Unknown, highly similar to 

B. subtilis YxiO protein 
Unknown, weakly similar to 
pyrophosphatase 
Unknown, conserved 
hypothetical protein 
Unknown, weakly similar to 
UDP-N-acetylglucosaminyl- 
3-enolpyruvate reductase 
Unknown, similar to glycine 
betaine/carnitine/choline 
ABC transporter 
Unknown, similar to glycine 
betaine/carnitine/choline 
ABC transporter 
(membrane protein) 
unknown, similar to 
Salmonella enterica PduX 
protein 
Unknown 
Unknown, similar to metal 
ion transport proteins 
Unknown, similar to 
betaine/carnitine/choline 
ABC transporter 
(membrane p) 
Unknown, similar to glycine 
betaine/carnitine/choline 

ABC transporter 
(osmoprotectant-binding 
protein) 
Unknown, similar to glycine 
betaine/carnitine/choline 
ABC transporter 
(membrane protein) 
Unknown, similar to glycine 
betaine/carnitine/choline 
ABC transporter (ATP- 
binding protein) 
Unknown, similar to 
unknown proteins 
Unknwon, similar to 
hypothetical PTS enzyme 
MB component 
Unknown, similar to 
unknown proteins 
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SeqID 


n° 653 


LM-1801.2 


From 


1461866 


SeqID 
SeqID 


n° 654 
n° 655 


LM-1 802.1 
LM-1 803.1 


From 
From 


1463506 
1464209 


SeqID 


n° 656 


LM-1 805.1 


From 


1465638 


SeqID 


n° 657 


LM-1 806.1 


From 


1467325 


SeqID 


n° 658 


LM-1 807.1 


From 


1468221 


SeqID 


n° 659 


LM-1 809.1 


From 


1469444 


SeqID 


n° 660 


LM-1810.1 


From 


1470685 


SeqID 
SeqID 


n° 661 
n° 662 


LM-1811.1 
LM-1813.1 


From 
From 


1472980 
1473889 


oeqiL* 


n DDO 


LIVI- lo 14. 1 


From 


I *t i 4oUZ 


SeqID 


n° 664 


LM-1815.1 


From 


1475944 


SeqID 
oeqiLJ 


n° 665 

M ODD 


LM-1816.1 

LIVI- I O I / . I 


From 
From 


1476642 
l*f/ /uou 


oeqi u 


n oo/ 


LIVI- I O I O. I 


rlOii 1 


I H I OU ( 1 


SeqID 


n° 668 


LM-1 82.1 


From 


1194890 


SeqID 
SeqID 
oeqiu 


n° 669 
n° 670 
n o/ i 


LM-1 820.1 
LM-1821.1 

LIVI- I OZO. I 


From 
From 
From 


1478477 
1479330 

I *f OUZ40 


oeqiu 


n D/ z 


LIVI- I OZO. I 


From 


i I 1 OO 


SeqID 


n° 673 


LM-1 826.1 


From 


1482096 


SeqID 


n° 674 


LM-1827.1 


From 


1483549 


SeqID 


n° 675 


LM-1 829.1 


From 


1484590 


SeqID 


n° 676 


LM-1 83.1 


From 


1194120 



to 


1463467 


Unknown, similar to ABC 
transporter (ATP-binding 
protein) 


to 


1464156 


Unknown 


to 


1465549 


Unknown, similar to 
glutathione reductase 


to 


1467305 


Unknown, similar to 
unknown proteins 


to 


1468206 


Unknown, similar to 
dihydrodipicolinate 
synthase 


to 


1469432 


Unknown, similar to 
aspartokinase I (alpha and 
beta subunits) 


to 


1470487 


Unknown, similar to 
aspartate-semialdehyde 
dehydrogenase 


to 


1472850 


unknown, similar to 
penicillin-binding protein 


to 


1473588 


superoxide dismutase 


to 


1474689 


unknown, similar to 
unknown proteins 


to 


1475908 


Unknown, similar to 
putative peptidoglycan 
acetylation protein 


to 


1476651 


Unknown, similar to 
transport proteins 


to 


1476968 


Unknown 


to 


1477934 


Unknown, similar to protein 
secretion PrsA (post- 
translocation molecular 
chaperone) 


to 


1478496 


transcriptional regulator 
ZurR (ferric uptake 
regulation) 


to 


1196083 


Unknown, similar to 
acetate kinase 


to 


1479355 


metal transport protein 


to 


1480103 


ABC transporter 


to 


1481172 


Unknown, conserved 
hypothetical protein 


to 


1482081 


unknown, similar to 
endonuclease IV 


to 


1483403 


Unknown, similar to ATP- 
dependent RNA helicase, 
DEAD-box family (deaD) 


to 


1484544 


Unknown, similar to E. coli 
LytB protein 


to 


1485711 


Unknown, conserved 
hypothetical protein 


to 


1194824 


Unknown, similar to 
glycerol uptake facilitator 
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SeqID n° 677 


LM-1 830.1 


From 1485708 


to 1486412 


SeqID n° 678 


LM-1 833.2 


From 1486480 


to 1487604 


SeqID n° 679 


LM-1 834.2 


From 289562 


to 290521 


SeqID n° 680 


LM-1 835.1 


From 289139 


to 289555 


SeqID n° 681 


LM-1 836 1 


From 287853 


to 98RQQ9 


SeqID n° 682 
SeqID n° 683 


LM-1 838.1 
LM-1 84.1 


From 286219 
From 1192974 


to 287718 
to 1194092 


SealD n° 684 
SeqID n° 685 
SealD n° 686 


I M-1R40 1 
LM-1 842.1 
LM-1 843 1 


From PfM^fi^ 

From 282755 
From 9ft1091 


to OP.P.C\-\ 1 
iu zouu j i 

to 284227 

tn 9ft9ziP1 


SealD n° 687 


LM-1 844 1 


From 280410 
n ui 1 1 ^out 1 iu 


lu zouyuj 


SealD n° 688 


LM-1 85 1 

1— 1 V 1 1 CvJ » | 


From 11Q154Q 

1 1 Ul II 1 I U 1 O'+C/ 


IU I I v7zlv7QO 


SealD n° 689 


LM-1 852 1 


From 97R72R 

t 1 UI 1 J /L I U f Z_ U 


LU Z.OUOOO 


SealD n° 690 


LM-1 854 1 

I 1 V 1 1 VJU*T. | 


From 27^00^ 


tn 27R c i ,:: i7 


SealD n° 691 


LM-1 856 2 


From 271^94 

l I ui m £- t I u^.*-r 


tn 979^09 


SeqID n° 692 


LM-1 858.1 


From 269754 


to 270257 


SeqID n° 693 
SealD n° 694 


LM-1 859.2 
LM-1 860 2 

1—lVI 1 


From 269067 
From 19Q^72 

1 1 Ul 1 1 1 UUUU / /i. 


to 269729 
to 1QQ4ft^7 


SealD n° 695 


LM-1 861 1 


From 1991047 

1 l UI 1 1 | UU lut/ 


to 1QQ^9R 


SeqID n° 696 


LM-1 862.1 


From 1989805 


to 1990812 


SeqID n° 697 


LM-1 863.1 


From 1988159 


to 1989802 


SeqID n°698 


LM-1 864.1 


From 1987365 


to 1988048 


SeqID n° 699 


LM-1 865.1 


From 1986342 


to 1987346 


SeqID n° 700 


LM-1 866.1 


From 1985218 


to 1986345' 



protein 

Unknown, conserved 
hypothetical protein 
RNA polymerase sigma 

factor RpoD 
Unknown, similar to other 
proteins 
Unknown, similar to 
transcriptional regulators 

Unknown, similar to 
succinyldiaminopimelate 
desuccinylase 
internalin E 
unknown, similar to 
NADPH-dependent butanol 
dehydrogenase 
internalin H 
internalin G 
Unknown, similar to 
phospho-beta-glucosidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
ethanolamine utilization 

protein EutE 
RNA polymerase (beta 

subunit) 
RNA polymerase (beta 
subunit) 
Unknown, similar to 

unknown protein 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
pyruvate formate-lyase 
Unknown, similar to 

peptidase 
Unknown, similar to 
malolactic enzyme (malate 
dehydrogenase) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
(hypothetical sensory 
transduction histidine 



WO 01/77335 



PCT/FR01/01118 



79 



kinase) 



SeqiD n° 701 LM-1 867.1 From 1984056 to 1985192 Unknown, similar to 

unknown proteins 
(hypothetical sensory 
transduction histidine 
kinase) 

SeqID n° 702 LM-1 869.1 From 1982630 to 1983736 Unknown, similar to 

oxidoreductases 

SeqID n° 703 LM-1871.1 From 1981774 to 1982640 Unknown, similar to 

unknown proteins 

SeqID n° 704 LM-1 872.1 From 1981299 to 1981634 Unknown, similar to 

unknown proteins 

SeqID n° 705 LM-1 874.1 From 1980511 to 1981302 Unknown, similar to 

dihydrodipicolinate 
reductase 

SeqID n° 706 LM-1 875.1 From 1980091 to 1980495 Unknown, similar to 

methylglyoxal synthase 
SeqID n° 707 LM-1 876.1 From 1978895 to 1980076 Unknown, similar to tRNA 

CCA-adding enzyme 

SeqID n° 708 LM-1 877.1 From 1977928 to 1978905 Unknown, similar to 

transcriptional regulator 
and biotin acetyl-CoA- 
carboxylase synthetase 
SeqID n° 709 LM-1 879.1 From 1977307 to 1977780 unknown; similar to 

thioredoxin 

SeqID n° 710 LM-1 88.1 From 1190547 to 1191542 Unknown, hyghly similar to 

Salmonella enterica PduO 
protein 

SeqID n° 711 LM-1 880.1 From 1976307 to 1977140 unknown, similar to 

ketopantoate 
hydroxymethyltransferases 
SeqID n° 712 LM-1 881.1 From 1975446 to 1976303 unknown, similarto 

panthotenate synthetases 
SeqID n° 713 LM-1 882.1 From 1975059 to 1975442 unknown, similar to 

aspartate 1- 
decarboxylases 

SeqID n° 714 LM-1 884.1 From 1972176 to 1974962 unknown, similar to ATP- 

dependent helicases 

SeqID n° 715 LM-1 885.1 From 1971547 to 1972131 unknown, similar to 

hypothetical proteins 

SeqID n° 716 LM-1 886.1 From 1970343 to 1971524 unknown, similar to 

aspartate 
aminotransferases 

SeqID n° 717 LM-1 888.1 From 1969037 to 1970329 unknown, similar to 

sparaginyl-tRNA 
synthetases 

SeqID n° 718 LM-189.1 From 1190269 to 1190532 Unknown, similarto carbon 

dioxide concentrating 
mechanism protein 
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Seq!Dn°719 LM-1891.1 

SeqlDn°720 LM-1 893.1 

SeqlDn°721 LM-1894.1 

SeqlDn°722 LM-1 896.1 

SeqlD n°723 LM-1 897.2 

SeqlDn°724 LM-1 898.2 

SeqlD n° 725 LM-19.1 

SeqlD n°726 LM-190.1 

SeqlD n°727 LM-1900.1 

SeqlD n°728 LM-1904.1 

SeqlD n° 729 LM-1 905.1 

SeqlD n°730 LM-1906.1 

SeqlD n° 731 LM-1 907.1 

SeqlD n° 732 LM-1 908.1 

SeqlD n° 733 LM-1910.1 

SeqlD n° 734 LM-1 91 1.1 

SeqlD n° 735 LM-1912.1 

SeqlD n° 736 LM-1913.1 

SeqlD n° 737 LM-1915.1 

SeqlD n° 738 LM-1916.1 

SeqlD n° 739 LM-1918.2 

SeqlD n° 740 LM-1 91 9.2 



80 

From 1968169 to 1968888 

From 1967499 to 1968158 

From 1 967015 to 1 967506 

From 1964489 to 1966972 

From 1963852 to 1964457 

From 2208576 to 221 0351 

From 27231 59 to 2723593 

From 1 1 89785 to 1 1 90264 

From 22071 03 to 2208362 

From 2205709 to 2207016 

From 2204857 to 2205708 

From 2203996 to 2204829 

From 220 1719 to 2203980 

From 2200723 to 220 1 544 

From 21 99365 to 2200723 

From 2197763 to 2199115 

From 2197267 to 2197728 

From 2196382 to 2197206 

From 2 1 94399 to 2 1 96339 

From 2193645 to 2194412 

From 2192693 to 2193448 

From 2191525 to 2192256 



unknown, similar to 
chromosome replication 
initiation protein 
unknown, probable 
endonuclease III (DNA 
repair) 
unknown 
unknown, similar to 
penicillin-binding protein 2A 

unknown, similar to DNA 
repair and homologous 
recombination protein 
Unknown, similar to 
maltogenic amylase 
Unknwon, similar to 
mannitol-specific PTS 
enzyme IIA component 
Unknown 
Unknown, similar to 
maltose/maltodextrin- 

binding protein 
Unknown, similar to 
maltodextrin transport 
system permease 
Unknown, similar to 
maltodextrin transport 
system permease 

Unknown, similar to 
maltodextrose utilization 
protein MalA 
Unknown, similar to 
maltosephosphorylase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phosphoglucomutase 
Unknown, similar to 
unknown proteins 
unknown 
Unknown, similar to ABC 
transporter (permease) 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
unknown, similar to 
unknown proteins 
Unknown, similar to FMN- 
containing NADPH-linked 
nitro/flavin reductase 
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SeqlDn°741 LM-1 92.1 

SeqiD n°742 LM-1 920.1 

SeqID n° 743 LM-1921.1 

SeqID n° 744 LM-1 922.1 

SeqID n° 745 LM-1923.1 

SeqID n°746 LM-1924.1 

SeqID n° 747 LM-1 927.1 

SeqID n° 748 LM-1 930.1 

SeqID n° 749 LM-1 932.1 

SeqID n° 750 LM-1 933.2 

SeqID n° 751 LM-1935.1 

SeqID n° 752 LM-1936.1 

SeqID n°753 LM-1938.1 

SeqID n°754 LM-1942.1 

SeqID n°755 LM-1944.1 

SeqID n°756 LM-1945.1 

SeqID n°757 LM-1 946.1 

SeqID n° 758 LM-1 947.1 

SeqID n°759 LM-1 948.1 

SeqID n° 760 LM-1949.1 

SeqID n° 761 LM-195.1 

SeqID n° 762 LM-1950.1 



81 

From 1 1 88949 to 1 1 89788 

From 2190489 to 2191445 

From 2189605 to 2190402 

From 2188430 to 2189563 

From 2187663 to 2188424 

From 2186557 to 2187411 

From 2184344 to 2186338 

From 2182823 to 2183800 

From 2182218 to 2182784 

From 2181329 to 2182216 

From 1610206 to 1611165 

From 1611 449 to 1 61 2405 

From 1 61 2395 to 1 61 3279 

From 1613497 to 1616823 

From 1616945 to 1617880 

From 1617901 to 1619214 

From 1619240 to 1619926 

From 1 620091 to 1 62 1 1 88 

From 1621230 to 1622342 

From 1622583 to 1623047 

From 1 188293 to 1 1 88928 

From 1623099 to 1624292 



Unknown, similar to 
ethanolamine utilization 
protein EutJ 
Unknown, similar to 
mannnose-6 phospate 
isomelase 
Unknown, similar to 
hydrolase 
Unknown, similar to N- 
acetylglucosamine-6- 
phosphate deacetylase 

Unknown, similar to 
transcriptional regulator 
(DeoR family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to ferrous 
iron transport protein B 

Unknown, similar to 
phosphotransacetylase 

Unknown 
Unknown, similar to a 
protein required for 
pyridoxine synthesis 
unknown, highly similar to 

6-phosphofructokinase 
unknown, highly similar to 
acetyl CoA carboxylase 

(alpha subunit) 
unknown, highly similar to 
acetyl-CoA carboxylase 

beta subunit 
unknown, highly similar to 
DNA polymerase III (alpha 
subunit) DnaE 
Unknown, similar to 
unknown proteins 
Unknown, simiiar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to X-Pro 

dipeptidase 
Unknown, similar to alanine 
dehydrogenase 
Unknown, similar to 

unknown protein 
unknown, similar to 
Salmonella enterica PduL 

protein 
unknown, highly similar to 
acetate kinase 
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SeqID n° 763 LM-1 951.1 From 1624316 to 1625314 

SeqID n° 764 LM-1 952.1 From 1625505 to 1626002 

SeqID n° 765 LM-1953.1 From 1626064 to 1626600 

SeqID n° 766 LM-1 954.1 From 1626620 to 1627633 

SeqID n° 767 LM-1955.1 From 1627812 to 1628615 

SeqID n° 768 LM-1 957.1 From 1628647 to 1629591 

SeqID n° 769 LM-1958.1 From 1629594 to 1630754 

SeqID n° 770 LM-1 959.1 From 1630751 to 1631503 

SeqID n° 771 LM-1960.1 From 1631516 to 1632712 

SeqID n° 772 LM-1 961. 2 From 1632728 to 1633759 

SeqID n° 773 LM-1962.2 From 1633933 to 1635144 

SeqID n° 774 LM-1 964.1 From 1635146 to 1636285 

SeqID n° 775 LM-1966.2 From 1636412 to 1638127 

SeqID n° 776 LM-1968.2 From 1308351 to 1310318 

SeqID n° 777 LM-1 97.1 From 1 187552 to 1 187989 

SeqID n° 778 LM-1970.1 From 1310315 to 1312774 

SeqID n° 779 LM-1971.1 From 1312857 to 1313324 

SeqID n° 780 LM-1 976.1 From 1317596 to 1319464 



Unknown, weakly similar to 
site specific DNA- 
methyltransferase 
Unknown, similar to thiol 
peroxidases 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

proteases 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
ornithine 
carbamoyltransferase 
unknown, highly similar to 
N-acetylornithine 
aminotransferase 
unknown, highly similar to 
N-acetylglutamate 5- 
phosphotransferase 
unknown, highly similar to 
ornithine acetyltransferase 
and amino-acid 
acetyltransferases 
unknown, similar to N- 
acetylglutamate gamma- 
semialdehyde 
dehydrogenases 
Unknown, similar to thiamin 
biosynthesis protein Thil 
Unknown, similar to iron- 
sulfur cofactor synthesis 

protein nifS 
Unknown, similar to B. 
subtilis negative regulator 
of FtsZ ring formation 
(EzrA) 

unknown, highly similar to 
DNA gyrase-like protein 
(subunit B) 
unknown, similar to 
Salmonella enterica PduK 

protein 
unknown, highly similar to 
DNA gyrase-like protein 
(subunit A) 
unknown, conserved 
hypothetical protein 
unknown; similar to 
acyltransferase (to B. 
subtilis YrhL protein) 
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SeqlDn°781 LM-1 978.1 From 1319677 to 1320375 

SeqID n° 782 LM-1 979.1 From 1320608 to 1322284 

SeqID n° 783 LM-1 98.1 From 1187192 to 1187539 

SeqID n° 784 LM-1 980.1 From 1322410 to 1323327 

SeqID n° 785 LM-1 981.1 From 1323450 to 1323683 

SeqID n° 786 LM-1 983.1 From 1323794 to 1325017 

SeqID n° 787 LM-1984.1 From 1325010 to 1326236 

SeqID n° 788 LM-1985.1 From 1326440 to 1326808 

SeqID n° 789 LM-1986.1 From 1326879 to 1328213 

SeqID n° 790 LM-1 988.1 From 1328357 to 1329652 

SeqID n° 791 LM-1989.1 From 1329696 to 1330217 

SeqID n° 792 LM-1991.2 From 1330247 to 1330861 

SeqID n° 793 LM-1992.2 From 1331018 to 1331347 

SeqID n° 794 LM-1995.1 From 1331813 to 1333807 

SeqID n° 795 LM-1 996.1 From 1334028 to 1334267 

SeqID n° 796 LM-1 997.1 From 1334318 to 1335160 

SeqID n° 797 LM-1 998.1 From 1335179 to 1335910 

SeqID n° 798 LM-1 999.1 From 1335932 to 1336438 

SeqID n° 799 LM-200.1 From 1185375 to 1187195 

SeqID n° 800 LM-2000.1 From 1336448 to 1337752 

SeqID n° 801 LM-2001.1 From 1337742 to 1338947 



unknown similar to 
glycerophosphodiester 
phosphodiesterase 
Unknown, similar to 
glycerol 3 phosphate 
dehydrogenase 
unknown, similar to diol 
dehydratase-reactivating 

factor small chain 
Unknown, similar to tRNA 
isopentenylpyrophosphate 

transferase 
Unknown, similar to host 
factor-1 protein 
unknown, conserved 
hypothetical protein similar 
to B. subtilis YnbA protein 
unknown, similar to 
aluminum resistance 
protein and to B. subtilis 
YnbB protein 
Unknown, similar to 
glutamine synthetase 
repressor 
unknown, highly similar to 

glutamine synthetases 
Unknown, similar to arsenic 
efflux pump protein 
unknown, conserved 
hypothetical protein 
unknown, highly similar to 
SOS response regulator 
lexA, transcription 
repressor protein 
unknown, similar to B. 
subtilis YneA protein 
unknown, highly similar to 

transketolase 
unknown, highly similar to 
B. subtilis YneF protein 
unknown 
unknown, weakly similar to 
arginine N- 
methyltransferases 
unknown, similar to E. coii 
YbdM protein 
diol dehydratase- 
reactivating factor large 
subunit 
unknown, similar to E. coli 
YbdN protein 
unknown 



WO 01/77335 



PCT/FR01/01118 



84 



SeqID n° 802 LM-2002.1 From 1338925 to 1339299 

SeqID n° 803 LM-2003.1 From 1339592 to 1340320 

SeqID n° 804 LM-2004.1 From 1340320 to 1340877 

SeqID n° 805 LM-2005.2 From 1341107 to 1341865 

SeqID n° 806 LM-2006.2 From 290593 to 291228 

SeqID n° 807 LM-2008.1 From 291312 to 293198 

SeqID n° 808 LM-2009.1 From 293212 to 293841 

SeqID n° 809 LM-201.1 From 1184818 to 1185330 

SeqID n° 810 LM-2010.1 From 293845 to 295281 

SeqID n° 811 LM-2011.1 From 295401 to 296213 



SeqID n° 812 LM-2012.1 From 296349 to 296852 
SeqID n° 813 LM-2013.1 From 296889 to 297551 
SeqID n° 814 LM-2014.1 From 297810 to 298652 



SeqID n° 815 LM-2015.1 From 298669 to 299490 

SeqID n° 816 LM-2017.1 From 299607 to 300578 

SeqID n° 817 LM-2018.1 From 300617 to 301717 

SeqID n° 818 LM-2019.1 From 302008 to 304140 

SeqID n° 819 LM-202.1 From 1184142 to 1184801 

SeqID n° 820 LM-2020. 1 From 304133 to 304684 



SeqID n° 821 LM-2022.1 From 304942 to 305613 
SeqID n° 822 LM-2023.1 From 305775 to 306554 

SeqID n° 823 LM-2024.1 From 306644 to 307306 



Unknown 
unknown, highly similar to 

uridylate kinases 
unknown, highly similar to 
ribosome recycling factors 

Unknown, similar to 
undecaprenyl diphosphate 
synthase 
Unknown, similar to 
phosphoglycerate mutase 
Unknown, similar to 
transporter 
Unknown 
Unknown, similar to diol 

dehydrase (diol 
dehydratase) gamma 
subunit (pddC) 
unknown, highly similar to 
phospho-beta-glucosidase 

Unknown, conserved 
hypothetical protein similar 
to B. subtilis YxeH protein 
Unknown 
Unknown 
Unknown, C-terminal part 
similar to B. subtilis 

ComEC protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
oxidoreductase 
Unknown, similar to sugar 
ABC transporter, ATP- 
binding protein 
Unknown, highly similar to 
anaerobic ribonucleoside- 
triphosphate reductase 
Unknown, similar to diol 

dehydrase (diol 
dehydratase) gamma 
subunit 
Unknown, highly similar to 
anaerobic ribonucleotide 
reductase activator protein 
Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to ABC 
transporter permease 
protein 
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SeqID n° 824 LM-2026.1 From 307303 to 308319 

SeqID n° 825 LM-2027.2 From 308334 to 309155 

SeqID n° 826 LM-203.1 From 1 182440 to 1 184104 

SeqID n° 827 LM-2035.1 From 2671624 to 267251 1 

SeqiD n° 828 LM-2037.1 From 2670042 to 2671523 

SeqID n° 829 LM-2038.1 From 2669152 to 2669994 

SeqID n° 830 LM-2039.1 From 2665666 to 2668653 

SeqID n° 831 LM-204.1 From 1181720 to 1182421 

SeqID n° 832 LM-2040.1 From 26651 90 to 2665666 

SeqID n° 833 LM-2042.1 From 2664343 to 2665128 

SeqID n° 834 LM-2043.1 From 2663619 to 2664296 

SeqID n° 835 LM-2044.1 From 2662243 to 2663622 

SeqID n° 836 LM-2045.1 From 2661076 to 2662164 

SeqID n° 837 LM-2046.1 From 2660408 to 2661076 

SeqID n° 838 LM-2047.1 From 2659852 to 2660145 

SeqID n° 839 LM-2048.2 From 2658966 to 2659841 

SeqID n° 840 LM-2049.3 From 403743 to 404198 

SeqID n° 841 LM-205.1 From 1181338 to 1181625 



SeqID n° 842 LM-2050.1 From 403279 to 403725 
SeqID n° 843 LM-2051 A From 402641 to 403060 
SeqID n° 844 LM-2053.1 From 401619 to 402539 



SeqID n° 845 LM-2054.1 From 400997 to 401299 



Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, putative 
lipoprotein 
unknown, highly similar to 
propanediol dehydratase, 
alpha subunit 
Unknwon, similar to 
transcription regulator 
TetR/AcrR family 
Unknwon, similar to drug- 
export proteins 
Unknwon, conserved 
hypothetical proteins 
Unknwon, similar to 
formate dehydrogenase 
alpha chain 
unknown, similar to 
Salmonella typhimurium 

PduB protein 
Unknown, similar to B. 
subtilis YrhD protein 
Unknwon, similar to 
formate dehydrogenase 

associated protein 
unknown, similar to two- 
component response 

regulator 
Unknwon, similar to 
response regulator 

histidine kinase 
Unknown, conserved 
hypothetical protein 
Unknwon, similar to ABC 
transporter, ATP-binding 
protein 
Unknwon, conserved 
hypothetical protein 
Unknwon 
Unknown 
unknown, similar to 
Salmonella typhimurium 
PduA protein 
Unknown 
Unknown 
Unknown, similar to 
putative transcription 
regulator 
Unknown, similar to PTS 
betaglucoside-specific 
enzyme II B component 
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SeqID n° 846 LM-2055.1 From 399645 to 400979 

SeqID n° 847 LM-2056.1 From 398210 to 399652 
SeqID n° 848 LM-2058.1 From 397340 to 398053 

SeqID n° 849 LM-2059.1 From 396964 to 397299 
SeqID n° 850 LM-2060.1 From 396208 to 396927 

SeqID n° 851 LM-2061 .1 From 395602 to 3961 1 1 

SeqID n° 852 LM-2063.1 From 394264 to 395529 

SeqID n° 853 LM-2064.1 From 393086 to 394246 

SeqID n° 854 LM-2065.1 From 391604 to 393052 

SeqID n° 855 LM-2066.1 From 390529 to 391473 

SeqID n° 856 LM-2067.1 From 389813 to 390430 

SeqID n° 857 LM-2068.1 From 388807 to 389541 

SeqID n° 858 LM-2069.1 From 387697 to 388467 

SeqID n° 859 LM-2070. 1 From 386780 to 387640 

SeqID n° 860 LM-2072.1 From 385386 to 386780 

SeqID n° 861 LM-2073.1 From 384926 to 385372 

SeqID n° 862 LM-2074.1 From 383710 to 384726 

SeqID n° 863 LM-2076.1 From 382016 to 383536 



Unknown, similar to PTS 
betaglucoside-specific 
enzyme IIC component 
Unknown, similar to beta- 
glucosidase 
Unknown, similar to 
transcription regulator 

(GntR family) 
Unknown, conserved 
hypothetical protein 
Unknown, conserved 
hypothetical protein, highly 
similar to B. subtilis Yeel 
protein 
Unknown, similar to 

different proteins 
Unknown, conserved 
hypothetical protein similar 
to B. subtilis YwbN protein 
Unknown, conserved 
hypothetical protein, 
putative lippoprotein 
Unknown, similar to 
conserved hypothetical 
protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 
Salmonella typhimurium 
peptidase E 
Unknown, similar to 
conserved hypothetical 
integral membrane protein 
Unknown, similar to 
transcriptional regulator 

(DeoR family) 
Unknown, similar to D- 
fructose-1 ,6-biphosphate 

aldolase 
Unknown, similar to PTS 
system, fructose-specific 
enzyme IIBC component 
Unknown, similar to PTS 
system, enzyme IIA 

component 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
Flavocytochrome C 
Fumarate Reductase chain 
A 
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SeqID n° 864 LM-2077.1 From 380253 to 381779 

SeqID n° 865 LM-2078.1 From 379794 to 380207 

SeqID n° 866 LM-2080.2 From 378964 to 379728 

SeqID n° 867 LM-2082.3 From 2052233 to 2052664 

SeqID n° 868 LM-2083.1 From 2052816 to 2053301 

SeqID n° 869 LM-2084.1 From 2053274 to 2053801 

SeqID n° 870 LM-2088.1 From 2054493 to 2056187 

SeqID n° 871 LM-2089.1 From 2056205 to 2057926 

SeqID n° 872 LM-2091.1 From 2057927 to 2058418 

SeqID n° 873 LM-2092.1 From 2058516 to 2059511 

SeqID n° 874 LM-2097.1 From 2059669 to 2061207 

SeqID n° 875 LM-2098.1 From 2061209 to 2062261 

SeqID n° 876 LM-21 A From 2723786 to 2725852 

SeqID n° 877 LM-2100.1 From 2062263 to 2063651 

SeqID n° 878 LM-2101.1 From 2063638 to 2064219 

SeqID n° 879 LM-2102.1 From 2064238 to 2065506 

SeqID n° 880 LM-2104.1 From 2065698 to 2066417 

SeqID n° 881 LM-21 06.1 From 2066455 to 2067756 

SeqID n° 882 LM-2107.3 From 2067887 to 2068897 



Unknown, similar to fatty- 
acid-CoA ligase 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
regulatory proteins (DeoR 
family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
dihydroxy-acid dehydratase 
Unknown, similar to 
acetolactate synthase 
(acetohydroxy-acid 
synthase) (large subunit) 
Unknown, similar to 
acetolactate synthase 
(acetohydroxy-acid 
synthase) (small subunit) 
Unknown, similar to ketol- 
acid reductoisomerase 
(acetohydroxy-acid 
isomeroreductase) 
Unknown, similar to 2- 
isopropylmalate synthase 
Unknown, similar to 3- 
isopropylmalate 
dehydrogenase 
Unknown, similar to 
transcriptional 
antiterminator 
Unknown, similar to 3- 
isopropylmaiate 
dehydratase (large subunit) 
Unknown, similar to 3- 
isopropylmalate 
dehydratase (small 

subunit) 
Unknown, similar to 
threonine dehydratase 
Unknown, similar to alpha- 
acetolactate decarboxylase 
Unknown, similar to 
pyrimidine-nucleoside 

phosphorylase 
Unknown, similar to 
transcription regulators 
(Lacl family) 
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SeqID n° 883 LM-2109.2 From 1677409 to 1680009 



SeqlDn°884 LM-2110.1 From 1680132 
SeqID n° 885 LM-21 11.1 From 1680644 



SeqID n° 886 

SeqID n° 887 

SeqID n° 888 

SeqID n° 889 
SeqID n° 890 



LM-21 13.1 From 1681561 

LM-2 1 1 4. 1 From 1 682664 

LM-2115.1 From 1683635 

LM-2116.1 From 1684229 

LM-21 19.1 From 1684563 



to 1680560 
to 1681564 

to 1682628 

to 1683638 

to 1684216 

to 1684462 
to 1687265 



SeqID n° 891 LM-2120.1 From 1687420 to 1688223 



SeqID n° 892 LM-2121.1 From 1688248 to 1688670 
SeqID n° 893 LM-2123.1 From 1688670 to 1691888 

SeqID n° 894 LM-2127.1 From 1691972 to 1695043 



SeqID n° 895 LM-2128.1 From 1695040 



SeqID n° 896 LM-2129.1 From 1696280 



SeqID n° 897 LM-2130.1 From 1696940 

SeqID n° 898 LM-2131.1 From 1697476 

SeqID n° 899 LM-2133.1 From 1698165 

SeqID n° 900 LM-2134.1 From 1698734 



SeqID n° 901 LM-2135.2 From 1700523 



SeqID n° 902 
SeqID n° 903 
SeqID n° 904 
SeqID n° 905 
SeqID n° 906 
SeqID n° 907 

SeqID n° 908 
SeqID n° 909 



LM-2 138.1 
LM-2 14.1 
LM-2 140.1 
LM-2141.1 
LM-2 142.1 
LM-2143.1 



From 366508 
From 1177116 
From 367533 
From 367898 
From 369377 
From 370032 



LM-21 45.1 From 370732 
LM-2147.1 From 371130 



to 1696164 
to 1696888 

to 1697302 
to 1697991 
to 1698653 

to 1700539 
to 1702292 



to 367032 
to 1177865 
to 367901 
to 369373 
to 369757 
to 370403 

to 371064 
to 373127 



Unknown, similar to 
Alcohol-acetaldehyde 

dehydrogenase 
Unknown, similar to 
unknown proteins 
Unknown, similar to similar 
to ABC transporter (ATP- 
binding protein) 
Unknown, similar to 
membrane proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to dna-3- 
methyladenine glycosidase 

Unknown 
unknown, highly similar to 
aconitate hydratases 
Unknown, similar to 
putative sigma factor 
regulator 
Unknown 
Unknown, similar to SNF2- 

type helicase 
Unknown, similar to ATP- 
dependent dsDNA 
exonuclease SbcC 
unknown, similar to 
putative exonucleases 
SbcD 

unknown, similar to 1- 
acylglycerol-3-phosphate 
O-acyltransferases 
unknown 
unknown 
unknown, similar to 
hypothetical proteins 
unknown, similar to ABC 
transporter (ATP-binding 

protein) 
unknown, similar to ABC 
transporter (ATP-binding 
protein) 
unknown 
unknown 
Unknown 
Unknown 
Unknown 
Unknown, weakly similar to 
inorganic pyrophosphatase 
Unknown 
Unknown, similar to 
transketolase 
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SeqID n° 910 


LM-2148 1 


"ILJllI Of O 


to o73785 


SeqID n° 911 


LM-PI'S 1 


rivjlil l I / DODU 


to 11 / 7091 


SeqID n° 912 


LM-2 150.1 


From 373832 


to 374596 


SeqID n° 913 


LM-2151.1 


From 374621 


to 375067 


SeqID n° 914 


LM-2 152.1 


From 375074 


to 375838 


SeqID n° 915 


LM-2 154.1 


From 375842 


to 376492 


SeqID n° 916 


LM-2 155.1 


From 376514 


to 377509 


SeqID n° 917 
SeqID n° 918 
SeqID n° 919 


LM-2 156 1 
LM-2 157.2 
LM-2 158.2 


From "3 77*^1 
i i \j\ no// 1 

From 377888 
From 378404 


\r\ 777Q70 
IO Of fO/Z 

to 378319 
to 378781 


SeqID n° 920 


LM-2 16.1 


From 1176304 


to 1176654 


SeqID n° 921 


LM-2161.2 


From 72408 


to 74222 


SeqID n° 922 
SeqID n° 923 
SeqID n° 924 
SeqID n° 925 
SeqID n° 926 


LM-2 162.1 
LM-2 163.1 
LM-2 164.1 
LM-2 166.1 
LM-2 17.1 


From 72062 
From 71360 
From 71064 
From 70676 
From 1175602 


to 72394 
to 72061 
to 71363 
to 71071 
to 1176156 



SeqID n° 


927 


LM-2 170.1 


From 66160 


to 70656 


SeqID n° 


928 


LM-2171.1 


From 64950 


to 66146 


SeqID n° 


929 


LM-2172.1 


From 64677 


to 64928 


SeqID n° 
SeqID n° 


930 
931 


LM-2174.1 
LM-2 178.1 


From 64144 
From 60948 


to 64659 
to 64154 


SeqID n° 


932 


LM-2179.1 


From 60506 


to 60799 



SeqID n° 933 LM-218.1 From 1174241 to 1175605 
SeqID n° 934 LM-2180.2 From 58897 to 60189 
SeqID n° 935 LM-2182.2 From 1814403 to 1815080 



Unknown, similar to 

transaldolase 
unknown, similar to 
Salmonella enterica PduV 
protein 
Unknown, similar to 
dehydrogenase/reductase 
Unknown, similar to sugar- 
phosphate isomerase 
Unknown, similar to 
triosephosphate isomerase 

Unknown, similar to 
dihydroxyacetone kinase 

Unknown, similar to 
dihydroxyacetone kinase 
Unknown 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
Salmonelle enterica PduU 

protein 
Unknwon, similar to toxin 
components 
Unknwon 
Unknwon 
Unknwon 
unknwon 
Unknown, similar to 
Salmonella enterica PduT 

protein 
Unknwon, highly similar to 
B. subtilis YukA protein 
Unknwon, similar to B. 
subtilis YukC protein 
Unknwon, similar to B. 
subtilis YukD protein 

Unknwon 
Unknwon, similar to B. 
subtilis YueB protein 
Unknwon, similar to a small 

heat shock protein of 
Clostridium acetobutylicum 

unknown, similar to 
Salmonella enterica PduS 

protein 
unknown, highly similar to 
adenylosuccinate 
synthetase 
Unknown, similar to two- 
component response 
regulator 
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SeqID n° 936 LM-2 184.1 From 1813457 to 1814332 

SeqID n° 937 LM-2185.1 From 1812978 to 1813436 

SeqID n° 938 LM-2186.1 From 1811219 to 1812961 

SeqID n° 939 LM-2188.1 From 1810157 to 1811197 



SeqID n° 940 LM-2189.1 From 1809118 to 1809759 

SeqID n° 941 LM-2191.1 From 1808460 to 1809107 

SeqID n° 942 LM-2 193.1 From 1807643 to 1808458 

SeqID n° 943 LM-2194.1 From 1806446 to 1807552 

SeqID n° 944 LM-2195.1 From 1805925 to 1806443 

SeqID n° 945 LM-2 198.1 From 1805041 to 1805928 

SeqID n° 946 LM-2 199.1 From 1800256 to 1804848 

SeqID n° 947 LM-22. 1 From 2726008 to 27271 95 

SeqID n° 948 LM-220.1 From 1 173643 to 1 174122 

SeqID n° 949 LM-2201.1 From 1798772 to 1800241 

SeqID n° 950 LM-2203.1 From 1797897 to 1798727 

SeqID n° 951 LM-2204.1 From 1796987 to 1797880 

SeqID n° 952 LM-2205.2 From 1795681 to 1796925 



SeqID n° 953 
SeqID n° 954 
SeqID n° 955 
SeqID n° 956 
SeqID n° 957 
SeqID n° 958 



LM-2207.2 
LM-2209.1 
LM-221.1 
LM-22 10.1 
LM-22 12.1 
LM-22 13.1 



From 
From 
From 
From 
From 
From 



2505425 
2505858 
1173144 
2506494 
2506867 
2507306 



to 2505835 
to 2506370 
to 1173551 
to 2506835 
to 2507250 
to 2508202 



SeqID n° 959 LM-2214.1 From 2508247 to 2508816 



Unknown, similar to 
unknown proteins 
Unknown 
unknown, highly similar to 

adenine deaminases 
Unknown, simitar to two- 
component sensor histidine 
kinase 

Unknown, similar to amino 
acid (glutamine) ABC 
transporter, permease 
protein 
Unknown, similar to amino 

acid (glutamine) ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to amino 
acid ABC transporter 

(binding protein) 
Unknown, similar to 
glycerol dehydrogenase 
Unknown, similar to 
unknown proteins 
transcription activator of 
glutamate synthase operon 
GltC 

Unknown, similar to 
glutamate synthase (large 

subunit) 
unknown, highly similar to 
translation elongation 
, factor EF-Tu 
unknown, similar to 
uroporphyrin-lll C- 
methyltransferase 
Unknown, similar to 
glutamate synthase (small 

subunit) 
Unknown, similar to sugar 

transport protein 
Unknown, similar to sugar 

transport protein 
Unknown, similar to sugar 
binding protein 
Unknown 
unknown 
unknown 
Unknown 
unknown 
Unknown, similar to 
transcription regulator 
Unknown 
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oeqiu n c>ou 
SeqID n° 961 


I R/r OOACZ A 
\-\v\-ZZ\D. \ 

LM-2216.1 


rrom ^ouoyy^: 
From 2510361 


to 250941 1 
to 2514293 


SeqID n° 962 


LM-2217.1 


From 2514308 


to 2515210 


SeqID n° 963 


LM-2219.1 


From 2515312 


to 2518587 


SeqID n° 964 


LM-2220.1 


From 2518971 


to 2519879 


SeqID n° 965 


LM-2221.1 


From 2519935 


to 2520399 


SeqID n° 966 


LM-2222.1 


From 2520415 


to 2522796 


SeqID n° 967 


LM-2224.2 


From 2522830 


to 2523576 


SeqID n° 968 


LM-2225.2 


From 2343056 


to 2343685 


SeqID n° 969 


LM-2226.1 


From 2341865 


to 2343013 


SeqID n° 970 


LM-2227.1 


From 2341064 


to 2341789 


Cionin n° Q*71 


I WA 999A A 


rrom zooyo i i 


TO Zo4 lU / \ 


SeqID n° 972 


LM-2229.1 


From 2338420 


to 2339418 


C^pn in n° Q7^ 

OCLjlL/ II <3 / O 

SeqID n° 974 


LM-2230.1 


From 1 1*79*707 

From 2337786 


to 2338406 


oeqiu n y / d 




Cmrv-i OOQCJC A Pi 

rrom zoooolo 


to Zoofoyy 


SeqID n° 976 


LM-2233.1 


From 2335862 


to 2336518 


beqlD n 977 


LM-2234.1 


From 2335468 


to 2335860 


SeqID n° 978 


LM-2235.1 


From 2334586 


to 2335455 


SeqID n° 979 


LM-2236.1 


From 2334003 


to 2334569 


SeqID n° 980 


LM-2237.1 


From 2333375 


to 2333857 


SeqID n° 981 


LM-2238.1 


From 2332888 


to 2333283 



unknown 
Unknown, similar to 

glycosidase 
Unknown, similar to 

internalin 
Unknown, similar to 

glycosidase 
Unknown, similar to 
transcription regulator 
Unknown, conserved 
hypothetical protein 
Unknwon, similar to 
exoribonuclease RNase-R 
Unknown, similar to 

carboxylesterase 
Unknown, similar to 
phosphoglucomutase 
Unknown, similar to 
aspartate aminotransferase 
Unknown, similar to amino 
acid ABC transporter (ATP- 

binding protein) 
Unknown, similar to amino 
acid ABC transporter, 
permease protein 
Unknown, similar to low- 
affinity inorganic phosphate 
transporter 
unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
putative ribosomal large 
subunit pseudouridine 
synthase 
Unknown, similar to 
methylphosphotriester- 
DNA alkyltransferase and 
transcriptional regulator 
Unknown, similar to 06- 
methylguanine-DNA 
methyltransferase 
Unknown, similar to 
transcriptional regulators 
(GntR family) 
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SeqID n° 982 
SeqID n" 983 



SeqID n° 984 
SeqID n° 985 



LM-2239.1 From 2332020 to 2332898 
LM-224. 1 From 1 1 72076 to 1 1 72648 



LM-2241.1 From 2330959 to 2332023 
LM-2242.1 From 2329731 to 2330921 



SenlD n° 986 


l_l VI Zl^.M''-T . | 


Prnm O'ZOR'Zfc'Z 
riUITI ZOziOOOo 


to zozyo 1 \ 


SealD n° 987 


1— IVI 1 


Frnm 9^97/lRi 


+<-> 0'59P'5'3n 
IO AOjLOOCAJ 


SealD n° 988 






tO 1 QMZ. 


SealD n° 98Q 
SeqID n° 990 


1 M-??^ 1 
1 — IV I ^L^LvJ . 1 

LM-2250.1 


Prom 1*1"7iftf^9 

ri urn i i / loo^i 
From 2324554 


xo n/ zuuy 
to 2325408 


SeqID n° 991 


LM-2251.1 


From 2323375 


to 2324253 


SeqID n° 992 


LM-2253.1 


From 2322031 


to 2323335 


SeqID n° 993 


LM-2254.2 


From 2320906 


to 2321775 


SeqID n° 994 


LM-2255.2 


From 2320416 


to 2320841 


SeqID n° 995 
SeqID n° 996 


LM-2256.3 
LM-2257.3 


From 1405578 
From 1404714 


to 1405859 
to 1405541 


SeqID n° 997 


LM-2258.3 


From 1403227 


to 1404678 


SeqID n° 998 


LM-2260.1 


From 1402550 


to 1403230 


SeqID n° 999 


LM-2261.2 


From 1400941 


to 1402359 



SeqID n° 1000 
SeqID n° 1001 



LM-2262.2 From 1399699 to 1400796 
LM-2263.1 From 1398077 to 1399327 



SeqID n° 1002 LM-2266.1 From 1397063 to 1398046 



Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ATP- 
dependent Clp protease 
proteolytic component 
Unknown 
Unknown, similar to 
transport system permease 
protein 
Unknown, similar to 
transport system permease 
protein 
Unknown, similar to 
oxidoreductase 
Unknown, similar to NADH 
oxidase 
unknwon 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcriptional regulators 
(LysR family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
arsenate reductase 
Unknown 
Unknown, similar to B. 
subtilis Spoil IJ protein 
two-component sensor 
histidine kinase 
two-component response 
regulator 
Unknown, similar to 6- 
phosphogluconate 

dehydrogenase 
Unknown, similar to 
aminotripeptidase 
Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E2 
subunit (lipoamide 
acyltransferase) 
Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E1 
subunit (2-oxoisovalerate 
dehydrogenase beta 
subunit) 
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SeqID n° 1003 LM-2267.1 From 1396051 to 1397046 



SeqID n° 1004 LM-2269.1 From 1394599 to 1396026 

SeqID n° 1005 LM-2270.1 From 1393517 to 1394584 

SeqID n° 1006 LM-2271.1 From 1392514 to 1393380 

SeqID n° 1007 LM-2273.1 From 1390694 to 1392385 

SeqID n° 1008 LM-2274.1 From 1390222 to 1390671 

SeqID n° 1009 LM-2275.1 From 1389207 to 1390031 

SeqID n° 1010 LM-2276.1 From 1387396 to 1389210 

SeqID n° 1011 LM-2278.1 From 1385934 to 1386815 

SeqID n° 1012 LM-2280.1 From 1384362 to 1385714 

SeqID n° 1013 LM-2283.1 From 1383489 to 1384343 



SeqID n° 1014 LM-2284.1 From 1383007 to 1383393 

SeqID n° 1015 LM-2285.1 From 1382569 to 1382976 

SeqID n° 1016 LM-2287.3 From 1381201 to 1382565 

SeqID n° 1017 LM-2289.3 From 1380720 to 1381187 

SeqID n° 1018 LM-2290.3 From 1380004 to 1380561 

SeqID n° 1019 LM-2291.2 From 1275845 to 1276693 



Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E1 
subunit (2-oxoisovalerate 
dehydrogenase alpha 

subunit) 
Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E3 
subunit 
Unknown, similar to 
branched-chain fatty-acid 
kinase 
Unknown, similar to 
phosphotransbutyrylase 
DNA repair and genetic 
recombination 
Unknown, similar to 
arginine repressor 
Unknown, conserved 
hypothetical protein 
Unknown, similar to D-1- 
deoxyxylulose 5-phosphate 
synthase 
Unknown, similar to 
geranyltranstransferase 

Unknown, similar to 
exodeoxyribonuclease VII 

(large subunit) 
unknown, highly similar to 
methylenetetrahydrofolate 

dehydrogenase and 
methenyltetrahydrofolate 
cyclohydrolase 
Unknown, similar to 
transcription termination 
protein (NusB) 
unknown, similar to B. 
subtilis YqhY protein 
acetyl-CoA carboxylase 
subunit (biotin carboxylase 

subunit) 
Unknown, similar to acetyl- 
CoA carboxylase subunit 
(biotin carboxyl carrier 
subunit) 
unknown, highly similar to 
elongation factor P (EF-P) 
unknown, similar to B. 
subtilis YxkD protein 
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Seq!Dn° 1020 LM-2292.1 From 1274883 to 1275542 



SeqiDn°1021 LM-2293.1 From 1273473 to 1274678 



SeqID n° 1022 LM-2294.1 From 1273231 to 1273476 

SeqlDn° 1023 LM-2295.1 From 1272717 to 1273193 

SeqID n° 1024 LM-2296.1 From 1272286 to 1272549 

SeqID n° 1025 LM-2298.1 From 1270849 to 1272261 



SeqID n° 1026 LM-2299.1 From 1269834 to 1270433 



SeqID n° 1027 LM-230.1 From 1168390 to 1169541 

SeqID n° 1028 LM-2300.1 From 1269262 to 1269669 

SeqID n° 1029 LM-2301.1 From 1268515 to 1269108 

SeqID n° 1030 LM-2302.1 From 1267115 to 1268473 

SeqID n° 1031 LM-2303.1 From 1266040 to 1266564 



SeqID n° 1032 LM-2304.1 From 1265392 to 1266003 

SeqID n° 1033 LM-2305.1 From 1264642 to 1265388 

SeqID n° 1034 LM-2306.1 From 1263829 to 1264629 

SeqID n° 1035 LM-2307.1 From 1263187 to 1263681 

SeqID n° 1036 LM-2308.1 From 1261917 to 1263131 

SeqID n° 1037 LM-231.1 From 1168021 to 1168368 

SeqID n° 1038 LM-2310.1 From 1259917 to 1261728 



SeqID n° 1039 LM-2312.1 From 1259530 to 1259841 
SeqID n° 1040 LM-2315.1 From 1257092 to 1259449 

SeqID n° 1041 LM-2317.1 From 1255357 to 1257069 



SeqID n° 1042 LM-2318.3 From 1254722 to 1255264 



unknown, similar to 
regulator of the Fnr CRP 
family (including PrfA) 
unknown; similar to 
antibiotic resistance protein 

unknown 
unknown, weakly similar to 
8-oxo-dGTPase (mutT) 

unknown 
Unknown, similar to ATP- 
dependent RNA helicase 

(DEAD motif) 
unknown weakly similar to 
phosphoglycerate mutase 
1 

unknown 
unknown 
unknown, similar to B. 
subtilis Ydel protein 

unknown 
unknown, conserved 
hypothetical protein, similar 
to B. subtilis YsnB protein 

unknown, conserved 
hypothetical protein, similar 
to B. subtilis YsnA protein 

Unknown, similar to 

ribonuclease PH 
Unknown, similar to 
glutamate racemase 
unknown, similar to B. 
subtilis YsIB protein 
Unknown, similar to 
aspartokinase II alpha 
subunit 
unknown, similar to 
regulatory proteins 
unknown, highly similar to 
excinuclease ABC subunit 
C 

thioredoxin 
unknown, similar to MutS 

protein (MutS2) 
unknown, similar to DNA 
polymerase beta, to B. 
subtilis YshC protein 
unknown, similar to B. 
subtilis YshB protein 
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SeqID n° 


1043 


LM-2319.2 


From 347658 


to 348428 


SeqID n° 


1044 


LM-232.1 


From 1167645 


to 1167953 


SeqID n° 


1045 


LM-2320.1 


From 347219 


to 34761 1 


SeqID n° 


1046 


LM-2322.1 


From 346377 


to 347036 


SeqID n° 


1047 


LM-2324.1 


From 343221 


to 344636 


SeqID n° 


1048 


LM-2325.1 


From 342551 


to 343195 


SeqID n° 


1049 


LM-2327.1 


From 341751 


to 342554 


SeqID n° 


1050 


LM-2329.1 


From 340945 


to 341754 



SeqID n° 1051 

SeqID n° 1052 

SeqID n° 1053 

SeqID n° 1054 

SeqID n° 1055 
SeqID n° 1056 
SeqID n° 1057 



LM-2332.1 From 340278 to 340952 

LM-2333.1 From 339270 to 340055 

LM-2334.1 From 338341 to 339087 

LM-2335.2 From 337161 to 338363 

LM-2336.3 From 336541 to 337161 

LM-2338.2 From 1026871 to 1029045 

LM-2339.1 From 1026376 to 1026855 



SeqID n° 1058 LM-234.1 From 1165962 to 1 167608 

SeqID n° 1059 LM-2340.1 From 1025159 to 1026190 

SeqID n° 1060 LM-2341.1 From 1024761 to 1025129 

SeqID n° 1061 LM-2342.1 From 1023346 to 1024680 

SeqID n° 1062 LM-2344.1 From 1022535 to 1023302 

SeqID n° 1063 LM-2345.1 From 1021665 to 1022432 

SeqID n° 1064 LM-2346.1 From 1020022 to 1021383 

SeqID n° 1065 LM-2347.1 From 1019545 to 1019970 



Unknown, similar to 
unknown proteins 
unknown, similar to B. 
subtilis YjcS protein 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phospho-beta-glucosidase 
Unknown, similar to 
thiamin-phosphate 
pyrophosphorylase (ThiE) 

Unknown, similar to 
phosphomethylpyrimidine 
kinase (ThiD) 
Unknown, similar to 
hydroxyethylthiazole kinase 
(ThiM) 

Unknown, similar to thiamin 
biosynthesis protein 
Unknown, similar to 

unknown protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
unknown proteins 
Unknown 
ATP-dependent protease 

unknown, similar to 
methylated-DNA-protein- 
cystein methyltransferase 
unknown, similar to ABC 
transporters, ATP-binding 
proteins (HI0664) 
unknown, similar to B. 
subtilis YkrP protein 
unknown 
unknown, similar to Na+- 
transporting ATP synthase 
subunit J 
unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
unknown, similar to 
regulatory proteins (MarR 
family) 
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SeqlDn° 1066 LM-2348.1 From 1017870 to 1019438 

SeqID n° 1067 LM-2349.1 From 1016934 to 1017788 

SeqID n° 1068 LM-235.2 From 1 164245 to 1 165960 

SeqID n° 1069 LM-2350.1 From 1016030 to 1016941 



SeqID n° 1070 LM-2351.1 From 1015605 to 1016033 
SeqID n° 1071 LM-2352.1 From 1015153 to 1015608 



SeqID n° 1072 LM-2353.1 From 1014540 to 1015019 

SeqID n° 1073 LM-2355.1 From 1013475 to 1014527 

SeqID n° 1074 LM-2358.1 From 1011860 to 1013335 

SeqID n° 1075 LM-2359.1 From 1011074 to 1011826 

SeqID n° 1076 LM-236.2 From 1 163277 to 1 163996 

SeqID n° 1077 LM-23611 From 1010313 to 1011077 

SeqID n° 1078 LM-2362.1 From 1009098 to 1010117 

SeqID n° 1079 LM-2363.1 From 1008130 to 1008879 

SeqID n° 1080 LM-2364.1 From 1007669 to 1008112 

SeqID n° 1081 LM-2365.2 From 1006959 to 1007633 

SeqID n° 1082 LM-2366.2 From 1937783 to 1938415 

SeqID n° 1083 LM-2368.1 From 1938538 to 1939155 

SeqID n° 1084 LM-2369.1 From 1939168 to 1939980 

SeqID n° 1085 LM-237,1 From 1 162666 to 1 163280 

SeqID n° 1086 LM-2371.1 From 1939999 to 1942638 



unknown, similar to peptide 
chain release factor 3 (RF- 
3) 

unknown, similar to 
Streptococcus agalactiae 
CylB protein 
ATP-bindfng transport 
protein 
unknown, similar to 
antibiotic ABC transporter, 
ATP-binding protein, 
unknown 
unknown, weakly similar to 
two-component response 
regulator 
unknown, similar to 
glutathione peroxidase 
unknown, similar to 
glucanase and peptidase 
unknown, similar to efflux 

transporter 
unknown, similar to ABC 
transporter transmembrane 
component 
unknown, similar to 
transcription regulators 

unknown, similar to 
daunorubicin resistance 
ATP-binding proteins 
unknown; similar to 
branched-chain amino acid 
aminotransferase 
unknown, similar to B. 
subtilis YjcH protein 
unknown, similar to B. 
subtilis YjcF protein 
unknown, similar to ribose 
5-phosphate isomerase 
unknown, similar to 
hemolysinlll proteins, 
putative integral membrane 
protein 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
conserved hypothetical 
proteins 
Unknown, similar to 
unknown proteins 
unknown, similar to 
pyruvate phosphate 
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dikinase 



SeqlDn°1087 LM-2372.1 From 1942687 to 1943064 



SeqlDn°1088 LM-2373.1 From 1943068 to 1944051 



SeqID n° 1089 

SeqID n° 1090 

SeqID n° 1091 

SeqID n° 1092 
SeqID n° 1093 

SeqID n° 1094 

SeqID n° 1095 



LM-2375.1 From 1944145 to 1944768 

LM-2378.1 From 1944908 to 1946419 

LM-2379.1 From 1946488 to 1947363 

LM-238.1 From 1161689 to 1162696 

LM-2380.1 From 1947416 to 1947898 

LM-2381.1 From 1947914 to 1948858 

LM-2385.1 From 1948871 to 1950763 



SeqID n° 1096 LM-2386.1 From 1950883 to 1951305 



SeqID n° 1097 LM-2387.2 From 1951284 to 1952564 



SeqID n° 1098 LM-2388.2 From 1952721 to 1953149 



SeqID n° 1099 LM-239.1 From 1161220 
SeqID n° 1100 LM-2390.2 From 148354 



SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 



1101 
1102 
1103 
1104 
1105 
1106 
1107 
1108 



LM-2391.1 
LM-2392.1 
LM-2393.1 
LM-2394.1 
LM-2395.1 
LM-2397.1 
LM-2398.1 
LM-24.1 



From 147884 
From 147399 
From 146764 
From 146306 
From 145869 
From 144397 
From 144097 
From 2727304 



to 1161675 
to 150009 



to 148291 
to 147764 
to 147381 
to 146656 
to 146306 
to 144840 
to 144273 
to 2729391 



SeqID n° 1109 LM-240.1 From 1160778 to 1161230 



SeqID n° 1 1 1 0 LM-2400. 1 

SeqID n° 1111 LM-2401.1 

SeqID n° 1112 LM-2402.1 

SeqID n° 1113 LM-2404.1 



From 143366 
From 141618 
From 141337 
From 141052 



to 143722 
to 143006 
to 141705 
to 141336 



unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
conserved hypothetical 
proteins, putative integral 
membrane protein 
unknown; similar to alkaline 
phosphatase 
unknown, similar to 
phosphoglucomutases 
unknown, similar to 
methyltransferases 

unknown 
unknown, similar to 
dihydrofolate reductases 
unknown, similar to 
thymidylate synthase 
unknown, similar to 
putative ABC transporters 

(A TP-binding protein) 
Unknown, similar to formyl- 
tetrahydrofolate synthetase 

C-terminal part 
Unknown, similar to formyl- 
tetrahydrofolate synthetase 
N-terminal part 
unknown, similar o 
transcriptional regulators 

unknown 
Unknown, similar to ABC 
transporter oligopeptide- 
binding protein 
Unknwon 
unknwon 

unknown 

Unknwon 

Unknown 

unknown 
unknown, highly similar to 
translation elongation 

factor G 
unknown, similar to E. coli 
YjaB protein 

unknown 

Unknwon 

Unknwon 

Unknwon 
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SeqlD n° 1114 LM-2406.1 From 139960 to 140853 



SeqID n° 1115 LM-2409.1 From 138999 to 139949 



SeqlD n° 1116 LM-241.2 From 1160220 to 1160753 
SeqID n° 1 1 17 LM-241 1 .2 From 137323 to 138897 



SeqlD n° 1118 

SeqID n° 1119 

SeqID n° 1120 

SeqID n° 1121 
SeqID n° 1122 



LM-2412.2 

LM-2414.1 

LM-2415.1 

LM-2416.1 
LM-241 7.1 



From 2164106 

From 2166012 

From 2167353 

From 2168054 
From 2169312 



to 2165377 

to 2167343 

to 2167943 

to 2169097 
to 2170526 



SeqID n° 1 123 LM-241 8.1 From 2170530 to 2171900 



SeqID n° 1124 
SeqID n° 1125 



LM-242.2 
LM-2421.1 



From 1159393 
From 2172068 



to 1159830 
to 2173591 



SeqID n° 1 126 LM-2423.1 From 2173836 to 2174486 

SeqID n° 1 127 LM-2424.1 From 2174488 to 2175420 

SeqID n° 1128 LM-2426.1 From 2175438 to 2176787 

SeqID n° 1129 LM-2428.1 From 2176815 to 2177090 

SeqID n° 1 130 LM-2429.1 From 2177096 to 2177563 

SeqID n° 1131 LM-243.1 From 1158930 to 1159403 

SeqID n° 1132 LM-2431.1 From 2177571 to 2179577 

SeqID n° 1133 LM-2432.2 From 2179758 to 2181203 



SeqID n° 1 134 LM-2433.2 From 161420 to 162250 
SeqID n° 1135 LM-2439.1 From 155993 to 156805 



Unknwon, similar to 
oligopeptide ABC 
transporter, permease 
protein 
Unknwon, similar to 
oligopeptide ABC 
transporter, permease 
protein 
unknown 
Unknwon, similar to 
oligopeptide ABC transport 
system substrate-binding 

proteins 
Unknown, weakly similar to 
transcription regulators 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulators 
Unknown, similar to lipases 

Unknown, similar to 
argininosuccinate synthase 
Unknown, similar to 
argininosuccinate lyase 
unknown 
glycine betaine transporter 
BetL 

Unknown, similar to L- 
fuculose-phosphate 
aldolase 
Unknown, similar to 1- 
phosphofructokinase 
Unknown, similar to PTS 
system galactitol-specific 
enzyme IIC component 
Unknown, similar to PTS 
system galactitol-specific 
enzyme MB component 
Unknown, similar to PTS 
system galactitol-specific 
enzyme IIA component 
unknown 
Unknown, similar to 
transcription antiterminator 
Unknown, similar to 
transcriptional regulator 

(GntR family) and to 
aminotransferase (MocR- 
like) 
unknwon 
Unknwown, conserved 
hypothetical protein 
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SeqlDn°1136 LM-244.1 From 1158425 to 1158844 

SeqlDn°1137 LM-2441.1 From 153608 to 155947 

SeqlDn°1138 LM-2442.1 From 152648 to 153328 

SeqlDn°1139 LM-2444.1 From 151839 to 152645 



SeqlDn°1140 LM-2446.1 From 151186 to 151890 



SeqlDn°1141 LM-2447.2 From 150232 to 151173 



SeqlDn°1142 LM-2448.2 From 2635167 to 2637920 

SeqID n° 1143 LM-245.1 From 1158020 to 1158388 

SeqlDn°1144 LM-2450.1 From 2637964 to 2639562 

SeqIDn°1145 LM-2452.1 From 2639930 to 2640466 



SeqlDn°1146 LM-2453.1 From 2640832 to 2642502 

SeqlDn°1147 LM-2455.1 From 2642499 to 2642948 

SeqID n° 1148 LM-2456.1 From 2643026 to 2643679 

SeqlDn°1149 LM-2457.1 From 2643973 to 2645295 



SeqlDn°1150 LM-2458.1 From 2645310 

SeqlDn°1151 LM-2459.1 From 2646686 

SeqID n° 1 1 52 LM-246. 1 From 1 157592 

SeqID n° 1 153 LM-2460.1 From 2647164 



SeqID n° 1154 
SeqID n° 1155 

SeqID n° 1156 

SeqID n° 1157 

SeqID n° 1158 
SeqID n° 1159 

SeqID n° 1160 
SeqID n° 1161 

SeqID n° 1162 
SeqID n° 1 163 



LM-246 1.1 
LM-2462.1 

LM-2463.1 

LM-2464.1 

LM-2465.1 
LM-2467.2 

LM-2469.1 
LM-247.3 

LM-2470.1 
LM-2473.1 



From 2648960 
From 2649598 

From 2650231 

From 2650758 

From 2651853 
From 2652174 

From 331543 
From 1156384 

From 331063 
From 329923 



to 2646146 
to 2647009 
to 1 158008 
to 2648825 



to 2649574 
to 2650230 

to 2650755 

to 2651756 

to 2652125 
to 2653085 

to 332688 
to 1157241 

to 331446 
to 330999 



unknown 
Unknwon, similar to ATP 
dependent helicase 
Unknwon 
Unknwon, similar to high- 
affinity zinc ABC 
transporter (membrane 

protein) 
Unknwon, similar to high- 
affinity zinc ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to a 
probable high-affinity zinc 
ABC transporter (Zn(ll)- 
binding lipoprotein) 
autolysin, amidase 
unknown 
unknown, highly similar to 
CTP synthases 
Unknown, similar to B. 
subtiiis RNA polymerase 

delta subunit 
arginyl tRNA synthetase 
Unknwon 
Unknwon, conserved 
hypothetical protein 
Unknown, conserved 
hypothetical protein 
Unknwon 
Unknown 
unknown 
Unknwon, similar to 
dipeptide ABC transporter 
(dipeptide-binding protein) 
Unknwon 
Unknwon, similar to 
nicotinamidase 
Unknwon, similar to Chain 
A, Dihydrofolate Reductase 
Unknown, similar to zinc- 
binding dehydrogenase 
Unknwon 
Unknwon, similar to cation 
transport protein 
Unknown 
unknown, similar to 
methylases 
Unknown 
Unknown, similar to low 
specificity L-allo-threonine 
aldolase 
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SeqID n° 1 164 LM-2474.1 From 328701 to 329966 

SeqID n° 1 165 LM-2475.1 From 327941 to 328495 

SeqID n° 1 166 LM-2476.1 From 327621 to 327905 

SeqID n° 1 167 LM-2477.1 From 327113 to 327454 



SeqID n° 1 168 LM-2478.1 From 325729 to 327120 



SeqID n° 1 169 LM-2480. 1 From 325422 to 325712 



SeqID n° 1 170 LM-2482.1 From 324104 to 325417 



SeqID n° 1 171 LM-2483.1 From 322134 to 324005 



SeqID n° 1172 LM-2485.1 From 321490 to 322128 
SeqID n° 1 173 LM-2486.1 From 320542 to 321279 



SeqID n° 1 174 LM-2487.1 From 319560 to 320423 



SeqID n° 1 175 LM-2488.2 From 319049 to 319528 



SeqID n° 1176 LM-2489.2 From 2118812 to2119786 



SeqID n° 1 177 LM-249.2 From 2788504 to 2790243 



SeqID n° 1178 LM-2490.1 From 2117304 to 2118671 



SeqID n° 1179 LM-2491.1 From 2116216 to2117307 



SeqlDn°1180 LM-2493.1 From2115381 to 2116193 



Unknown 
Unknown, putaive 
secreted, lysin rich protein 

Unknown 
Unknown, similar to PTS 
beta-glucoside-specific 
enzyme IIA component 
Unknown, similar to 
phospho-beta-glucosidase 
and phospho-beta- 
galactosidase 
Unknown, similar to PTS 
beta-glucoside-specific 
enzyme IIB component 
Unknown, similar to PTS 
beta-glucoside-specific 
enzyme IPC component 
Unknown, similar to 
transcriptional 
antiterminator (BgIG family) 

Unknown 
Unknown, similar to FMN- 
containing NADPH-linked 
nitro/flavin reductase 
Unknown, similar to 
transcription regulator 

LysR-gltR family 
Unknown, conserved 
hypothetical protein, highly 
similar to B. subtilis YydA 
proteinYyd 
Unknown, similar to 
phospho-N- 
acetylmuramoyl- 
pentapeptide transferase 
unknown, highly similar to 
ABC transporter (ATP- 
binding protein) required 
for expression of 
cytochrome BD 
Unknown, similar to UDP- 
N-acetylmuramoylalanine 
D-glutamate ligase 
Unknown, similar to 
peptidoglycan synthesis 
enzymes, putative 

phospho-N- 
acetylmuramoyl- 
pentapeptide-transferase 
Unknown, similar to cell- 
division initiation protein 
divlB 
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SeqlDn°1181 LM-2495.1 From 2113734 to 2115014 

SeqID n° 1182 LM-2496.1 From 2112492 to 2113667 

SeqlDn°1183 LM-2497.1 From2111684 to 2112373 

SeqlDn°1184 LM-2499.1 From2111222 to 2111680 

SeqID n° 1 185 LM-25.1 From 2729457 to 2729927 

SeqID n° 1186 LM-2500.1 From2110909 to2111199 

SeqID n° 1187 LM-2501.1 From2110009 to 2110785 

SeqID n° 1188 LM-2506.1 From 2104901 to 2106001 

SeqID n° 1 189 LM-2507.1 From 2104059 to 2104904 

SeqID n° 1 190 LM-2508.2 From 2102608 to 2104062 

SeqID n° 1 191 LM-2509.2 From 2101371 to 2102477 

SeqID n° 1 192 LM-251.1 From 2790243 to 2791967 

SeqID n° 1 193 LM-2510.2 From 2034693 to 2035517 

SeqID n° 1194 LM-2511.1 From 2035507 to 2036505 

SeqID n° 1195 LM-2512.1 From 2036521 to 2037141 

SeqID n° 1 196 LM-2514.2 From 2037141 to 2037956 

SeqID n° 1 197 LM-2515.2 From 2037953 to 2038840 

SeqID n° 1 198 LM-2517.1 From 2039441 to 2039998 

SeqID n° 1 199 LM-2518.1 From 2040274 to 2040933 

SeqID n° 1200 LM-252.1 From 2791967 to 2792980 

SeqID n° 1201 LM-2520.1 From 2040911 to 20421 10 



unknown, highly similar to 
cell-division protein FtsA 
unknown, highly similar to 
cell-division initiation 

protein FtsZ 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
ribosomal protein S7 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
quinolinate synthetase 
Unknown, similar to 
nicotinate-nucleotide 
pyrophosphorylase 
Unknown, similar to L- 
aspartate oxidase 
Unknown, similar to a NifS- 
like protein required for 
NAD biosynthesis 
unknown, highly similar to 
ABC transporter required 
for expression of 
cytochrome BD 
Unknown, similar to 
ferrichrome ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 

oxidoreductases 
Unknown, similar to 
transcription regulators 
(TetR family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
cytochrome D ubiquinol 

oxidase subunit II 
Unknown, similar to toxic 
ion resistance proteins 
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SeqID n° 1202 LM-2521.1 From 2042157 to 2042900 
SeqID n° 1203 LM-2522.1 From 2042913 to 2043521 

SeqID n° 1204 LM-2523.1 From 2043540 to 2044457 

SeqID n° 1205 LM-2524.1 From 2044481 to 2045749 
SeqID n° 1206 LM-2527.1 From 2046046 to 2046489 

SeqID n° 1207 LM-2528.1 From 2046507 to 2047256 

SeqID n° 1208 LM-2529.1 From 2047444 to 2048514 

SeqID n° 1209 LM-2530.1 From 2048623 to 2049414 
SeqID n° 1210 LM-2531.1 From 2049419 to 2050339 
SeqID n° 1211 LM-2532.4 From 2050639 to 20521 14 

SeqID n° 1212 LM-2533.2 From 1731955 to 1732893 
SeqID n° 1213 LM-2536.1 From 1732926 to 1734779 

SeqID n° 1214 LM-2538.1 From 1734776 to 1735948 
SeqID n° 1215 LM-2539.1 From 1735941 to 1737065 

SeqID n° 1216 LM-254.1 From 2792967 to 2794373 

SeqID n° 1217 LM-2540.2 From 1737087 to 1739384 

SeqID n° 1218 LM-2542.2 From 2088797 to 2091454 
SeqID n° 1219 LM-2543.1 From 2087489 to 2088793 
SeqID n° 1220 LM-2544.1 From 2086817 to 2087428 



unknown, similar to 
creatinine amidohydrolases 
Unknown, similar to 2-keto- 
3-deoxygIuconate-6- 
phosphate aldolase 
Unknown, similar to 
putative 
phosphotriesterase related 
proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to PTS 
system enzyme II A 

component 
Unknown, similar to 
transcription regulators, 
(GntR family) 
Unknown, similar to E. coli 
DNA-damage-inducibile 
protein dinP 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
glucose-6-phosphate 1- 
dehydrogenase 
unknown, similar to 
menaquinone biosynthesis 
proteins 
unknown, similar to 5- 
methyltetrahydrofolate- 
homocysteine 
methyltransferase (metH) 

unknown; similar to 
cystathionine beta-lyase 

unknown, similar to 
cystathionine gamma- 
synthase 
unknown, highly similar to 
cytochrome D ubiquinol 
oxidase subunit I 
unknown, similar to 
cobalamin-independent 
methionine synthase 
Unknown, similar to 
putative sugar hydrolases 
Unknown, similar to 
unknown proteins 
unknown, similar to 
unknown proteins 
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SeqID n c 


. 1221 


LM-2545.1 


From 2085021 


to 2086760 


unknown, similar to two- 
component sensor histidine 
kinase 


SeqID n c 


' 1222 


LM-2546.1 


From 2083537 


to 2085021 


unknown, similar to two- 
component response 


SeqID n c 


' 1223 








regulator 


LM-2547.1 


From 2082495 


to 2083424 


Unknown, similar to 
putative transport system 


SeqID n° 










integral membrane protein 


1224 


LM-2 548.1 


From 2081505 


to 2082476 


Unknown, similar to ABC 
transporter, permease 


SeqID n° 










protein 


1225 


LM-2549.1 


From 2080027 


to 2081484 


Unknown, similar to 
putative sugar-binding 


SeqID n° 


1226 








lipoproteins 


LM-255.1 


From 2794755 


to 2795225 


Unknown, conserved 


SeqID n° 


1227 








hypothetical proteins 


LM-2553.1 


From 2078132 


to 2079829 


Unknown, similar to alpha- 
acetolactate synthase 


SeqID n° 










protein, AIsS 


1228 


LM-2554.3 


From 2077019 


to 2078014 


Unknown, similar to 
oxidoreductase 


SeqID n° 


1229 


LM-2557.2 


From 634479 


to 635279 


Unknown, similar to 
transport proteins 


SeqID n° 










(formate?) 


1230 


LM-2559.2 


From 633709 


to 634251 


Unknown 


SeqID n° 


1231 


LM-2560.1 


From 63281 1 


to 633581 


Unknown, similar to 
unknown membrane 


SeqID n° 


1232 








proteins 


LM-2561.1 


From 631045 


to 63281 1 


Unknown, similar to a 
fusion of two types of 
conserved hypothetical 
proteinconserved 
hypothetical 


SeqID n° 


1233 


LM-2562.1 


From 630629 


to 631042 


Unknown 


SeqID n° 


1234 


LM-2563.1 


From 629088 


to 630491 


Unknown, similar to DNA 


SeqID n° 










photolyase 


1235 


LM-2565.1 


From 626506 


to 628971 


Unknown, putative 


SeqID n° 










secreted protein 


1236 


LM-2566.1 


From 625463 


to 626488 


Unknown 


SeqID n° 


1237 


LM-2567.2 


From 624682 


to 625395 


Unknown, putative 


SeqID n° 










secreted protein 


1238 


LM-2568.3 


From 702088 


to 702954 


Unknown, conserved 


SeqID n° 


1239 








hypothetical proteins 


LM-2570.1 


From 701247 


to 702062 


unknown, highly similar to 
phosphomethylpyrimidine 
kinase thiD 


SeqID n° 1240 


LM-2571.1 


From 700876 


to 701184 


Unknown, similar to 












unknown proteins 


SeqID n° 


1241 


LM-2573.1 


From 700537 


to 700821 


unknown, similar to 
transposases 
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SeqID n c 


1 1242 


LM-2574.1 


From 69941 0 


to 700306 


SeqID n c 


' 1243 


LM-2575.1 


From 698785 


to 699420 


SeqID n c 
SeqID n c 


' 1244 
' 1245 


LM-2577.1 
LM-2578.1 


From 698317 
From 697634 


to 698733 
to 698197 


SeqID n c 


' 1246 


LM-2579.1 


From 696883 


to 697590 


oeqiL> n 






rrom 2795420 


to 2796997 


oeqiLJ n 
SeqID n° 


A OA Q 

1249 


LM-Zool .1 

LM-2583.1 


hrom 695496 
From 694973 


to 696416 
to 695509 


SeqID n° 


1250 


LM-2584.1 


From 694317 


to 694964 


oeqiu n 




I tn OCQ7 -\ 


hrom byi do1 


to 694271 


SeqID n° 


1252 


LM-2589.1 


From 690897 


to 691538 


SeqID n° 


1253 


LM-259.1 


From 2797039 


to 2797758 


SeqID n° 


1254 


LM-2590.1 


From 689923 


to 690873 


SeqID n° 
SeqID n° 


1255 
1256 


LM-2592.1 
LM-2593.1 


From 689541 
From 688619 


to 689825 
to 689458 


SeqID n° 


1257 


LM-2594.2 


From 687138 


to 688529 


oeqiLJ n 


IZDO 


I 0C07 <3 


rrom n io/o9 


to 1 1 1 5630 


SeqID n° 


1259 


LM-2598.3 


From 1115647 


to 1116513 


SeqID n° 


1260 


LM-2599.1 


From 1116532 


to 1117092 


SeqID n° 
SeqID n° 


1261 
1262 


LM-26.1 
LM-260.1 


From 2729958 
From 2797838 


to 2730371 
to 2798572 


SeqID n° 


1263 


LM-2600.1 


From 1117093 


to 1118079 


SeqID n° 


1264 


LM-2601.1 


From 1118082 


to 1118912 


SeqID n° 


1265 


LM-2602.1 


From 1118992 


to 1121022 



Unknown, similar to 
transcription regulator (Rgg 
type) 

Unknown, conserved 
hypothetical protein 

Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
phosphoprotein 
phosphatases 
Unknown, similar to 
aceiate-CoA ligase 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulator 
Unknown, conserved 
membrane protein 
Unknown, similar to 
transcription regulators 
Unknown, similar to 
giucosamine-6-phosphate 
isomerase 
Unknown, similar to 
membrane proteins 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to amino 
acid transporter 
unknown, similar to B. 
subtilis minor teichoic acids 
biosynthesis protein GgaB 

Unknown, similar to 
glucose-1-phosphate 
thymidyl transferase 
Unknown, similar to dTDP- 
sugar epimerase 
ribosomal protein S12 
Unknown, similar to merR- 
family transcriptional 
regulator 
Unknown, similar to dTDP- 
D-glucose 4,6-dehydratase 
unknown, similar to DTDP- 

L-rhamnose synthetase 
unknown, similar to teichoic 
acid biosynthesis protein B 



WO 01/77335 



PCT/FR01/01118 



105 



SeqlDn°1266 LM-2603.1 From 1121098 to 1121808 



SeqID n° 


1267 


LM-2604.1 


From 1121805 


to 1 122830 


SeqID n° 


1268 


LM-2606.1 


From 1122918 


to 1124078 


SeqID n° 


1269 


LM-2607.1 


From 1 124079 


to 1124462 


SeqID n° 


1270 


LM-2608.1 


From 1124484 


to 1125467 


SeqID n° 


1271 


LM-2609.1 


From 1125482 


to 1 126495 


SeqID n° 1272 


LM-261.1 


From 2798626 


to 2799087 


SeqID n° 


1273 


LM-2611.1 


From 1 126704 


to 1128194 


SeqID n° 


1274 


LM-2613.1 


From 1 128206 


to 1129030 



SeqID n° 1275 LM-2614.1 From 1129043 to 1129351 
SeqID n° 1276 LM-2615.1 From 1129412 to 1129816 



SeqID 


n° 


1277 


LM-2616.1 


From 1 129945 


to 1131501 


SeqID 


n° 1278 


LM-2618.1 


From 1131558 


to 1132760 


SeqID 


n° 


1279 


LM-2619.1 


From 1133698 


to 1134117 


SeqID 


n° 


1280 


LM-262.1 


From 2799108 


to 2799551 


SeqID 


n° 


1281 


LM-2621.4 


From 1134472 


to 1136595 


SeqID 


n° 


1282 


LM-2623.2 


From 2119833 


to 2121308 


SeqID 


n° 


1283 


LM-2625.2 


From 2121483 


to 2123738 


SeqID 


n° 


1284 


LM-2626.2 


From 2123735 


to 2124097 


SeqID 


n° 


1285 


LM-2628.1 


From 2124114 


to 2125052 



unknown, similar to CDP- 
ribitol pyrophosphorylase 
unknown, similar to glucitol 

dehydrogenase 
unknown, similar to teichoic 
acid biosynthesis protein B 

precursor 
unknown, highly similar to 

glycerol-3-phosphate 
cytidylyltransf erase (get), 
CDP-glycerol 
pyrophosphorylase 
(teichoic acid biosynthesis 
protein D) 
unknown, similar to 
glycosyltransferases 
unknown, siumilarto 
glysosyltransferases 
Unknown, similar to 
unknown proteins 
unknown, conserved 
hypothetical protein, similar 
to B. subtilis YueK protein 

unknown, similar to NH(3)~ 
dependent NAD(+) 
synthetases, nitrogen 
regulatory protein 
unknown 
unknown, similar to PTS 
system, celiobiose-specific 

MB component (eel A) 
unknown, highly similar to 
GMP synthetase 
unknown, similar to 

integrases 
unknown, similar to a 
protein encoded by Tn916 
Unknown, similar to 
unknown proteins 
cadmium resistance protein 
Unknown, similar to UDP- 
N-acetylmuramoylalanyl-D- 
glutamate-2,6- 
diaminopimelate ligase 
Unknown, similar to 
penicillin-binding protein 2B 

Unknown, similar to cell- 
division protein FtsL 
Unknown, similar to 
unknown proteins 
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SeqID n° 


1286 


LM-2629.1 


From 2125065 


to 2125496 


SeqID n° 


1287 


LM-263.1 


From 2799609 


to 2800961 


SeqID n° 


1288 


LM-2631.2 


From 2125699 


to 2126946 


SeqID n 


1289 


LM-2632.1 


From 2127062 


to 2128876 


SeqID n 
SeqID n° 


1290 
1291 


Llvl-2633.1 
LM-2635.1 


From 2128806 
From 2129185 


to 2129192 
to 2130078 


SeqID n° 


1292 


LM-2637.1 


From 2130492 


to 2131028 


SeqID n° 


1293 


LM-2638.1 


From 2131158 


to 2132330 


SeqID n° 


1294 


LM-2639.1 


From 2132412 


to 2134652 


SeqID n 


1295 


LM-264. 1 


From 2800958 


to 2801407 


SeqID n 


1296 


1 R K O /"» A A A 

LM-264 1.1 


From 2134695 


to 2135735 


SeqID n° 


1297 


LM-2642.1 


From 2135754 


to 2136236 


SeqID n° 


1298 


LM-2643.1 


From 2136239 


to 2136796 


SeqID n° 


1299 


LM-2645.1 


From 2136893 


to 2137174 


SeqID n° 


1300 


LM-2647.1 


From 2137205 


to 2137654 


SeqID n° 


1301 


LM-2648.1 


From 2137702 


to 2138757 


SeqID n° 


1302 


LM-265.1 


From 2801577 


to 2802458 


SeqID n° 


1303 


LM-2650.2 


From 2138904 


to 2139809 


SeqID n° 


1304 


LM-2651.2 


From 1994751 


to 1995434 


SeqID n° 


1305 


LM-2652.1 


From 1995520 


to 1996116 


SeqID n° 


1306 


LM-2653.1 


From 1996204 


to 1996749 


SeqID n° 


1307 


LM-2654.1 


From 1996784 


to 1998037 


SeqID n° 


1308 


LM-2657.1 


From 1998168 


to 1999454 


SeqID n° 


1309 


LM-2659.1 


From 1999468 


to 2000571 



Unknown, similar to 
unknown proteins 
Unknown, conserved 
hypothetical proteins 
Unknown, similar to 
integral membrane proteins 

Unknown, similar to 
transporter binding proteins 

Unknown 
Unknown, weakly similar to 
ketopantoate reductase 
involved in thiamin 

biosynthesis 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
excinuclease ABC (subunit 
A) 

Unknown, similar to 
transcription regulators 
Unknown, weakly similar to 
proteases 
Unknown, similar to 
phosphopantetheine 
adenylyltransferase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
heme A farnesyltransferase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknwon, similar to 5- 
enolpyruvylshikimate-3- 
phosphate synthase 
Unknown, similar to 
prephenate dehydrogenase 
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SeqID n° 1310 LM-266.1 From 2802543 to 2802998 

SeqID n° 131 1 LM-2660.1 From 2000592 to 2001674 

SeqID n° 1312 LM-2661.1 From 2001674 to 2002048 

SeqID n° 1313 LM-2662.1 From 2002045 to 2003142 

SeqID n° 1314 LM-2664.2 From 2003145 to 2004311 

SeqID n° 1315 LM-267.1 From 2802998 to 2803402 

SeqID n° 1316 LM-2675.4 From 1748263 to 1748709 

SeqID n° 1317 LM-2676.2 From 1356769 to 1357683 

SeqID n° 1318 LM-2677.1 From 1356327 to 1356671 

SeqID n° 1319 LM-2679.1 From 1356032 to 1356310 

SeqID n° 1 320 LM-268. 1 From 2803445 to 2804056 

SeqID n° 1321 LM-2683.1 From 1353696 to 1356035 

SeqID n° 1322 LM-2684.1 From 1353374 to 1353673 

SeqID n° 1323 LM-2685.1 From 1353097 to 1353381 

SeqID n° 1324 LM-2687.1 From 1351964 to 1353082 

SeqID n° 1325 LM-2689.1 From 1351467 to 1351934 

SeqID n° 1326 LM-269.1 From 2804070 to 2804438 

SeqID n° 1327 LM-2692.1 From 1346951 to 1351285 



Unknown, similar to 
transcription regulator 
MerR family 
Unknown, similar to 
histidinol-phosphate 
aminotransferase and 
tyrosine/phenylalanine 
aminotransferase 
Unknown, similar to 
chorismate mutase 
Unknown, similar to 3- 
dehydroquinate synthase 
Unknown, similar to 
chorismate synthase 
Unknown, similar to 
unknown proteins 
unknown, similar to 
transcription regulators 
(Fur family), PerR in B. 
subtilis 
unknown, highly similar to 
tRNA pseudouridine 55 
synthase 
unknown, highly similar to 
ribosome-binding factor A 

unknown, conserved 
hypothetical protein similar 
to B. subtilis YlxP protein 
unknown, similar to 
phosphatase 
unknown, highly similar to 
translation initiation factor 
IF-2 

unknown, conserved 
hypothetical protein, similar 
to B. subtilis YlxQ protein 

unknown, similar to B. 
subtilis YlxR protein 
unknown, highly similar to 
N utilization substance 
protein A (NusA protein) 
unknown, conserved 
hypothetical protein, similar 
to B. subtilis YlxS protein 

unknown, similar to 
transcription regulator 
(RpiR family) 
unknown, highly similar to 
DNA polymerase III (alpha 
subunit) 
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SeqID n° 1328 LM-2693.1 From 1345140 to 1346846 
SeqID n° 1329 LM-2696.2 From 1343838 to 1345100 



SeqID n° 1330 LM-2698.2 From 1342682 to 1343824 



SeqID n° 1331 LM-2699.2 From 1341879 to 1342667 



SeqID n° 1332 LM-27.1 From 2730564 to 2731961 

SeqID n° 1333 LM-270.1 From 2804522 to 2805274 
SeqID n° 1334 LM-2701.3 From 1779714 to 1780802 



SeqID n° 1335 LM-2702.1 From 1780824 to 1781126 



SeqID n° 1336 LM-2703.1 From 1781196 to 1781504 



SeqID n° 1337 LM-2705.2 From 1781661 to 1784339 



SeqID n° 1338 LM-2707.2 From 1784473 to 1785798 

SeqID n° 1339 LM-2708.1 From 1785841 to 1786614 
SeqID n° 1340 LM-2710.1 From 1786607 to 1787287 



SeqID n° 1341 LM-271 1.1 From 1787280 to 1787648 

SeqID n° 1342 LM-271 2.1 From 1787769 to 1788752 

SeqID n° 1343 LM-271 3.1 From 1788812 to 1789777 

SeqID n° 1344 LM-271 4.1 From 1789825 to 1793085 

SeqID n° 1345 LM-271 7.2 From 1793082 to 1795253 

SeqID n° 1346 LM-2718.2 From 1561978 to 1564242 



proIyl-tRNA synthetase 
unknown, conserved 
hypothetical protein similar 
to B. subtilis YluC protein 

Unknown, similar to 
deoxyxylulose 5-phosphate 
reductoisomerase 
unknown, similar to 
phosphatidate 
cytidyly [transferase (CDP- 

diglyceride synthase) 
Unknown, similar to dGTP 
triphosphohydrolase 

unknown 
unknown, similar to 
putative outer surface 
protein 
unknown, similar to 
phosphotransferase 
system (PTS) lichenan- 
specific enzyme IIA 

component 
unknown, similar to 
phosphotransferase 
system (PTS) lichenan- 
specific enzyme IIB 

component 
unknown, similar to 
transcriptional regulator 

(NifA/NtrC family) 
unknown, similar to ATP- 
dependent RNA helicases 

unknown 
ubknown, similar to ABC 
transporter, ATP-binding 
protein 
unknown, similar to 
transcriptional regulator 
(GntR family) 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
transcription regulators 
(Lad family) 
unknown, some similarities 
to cellobiose- 
phosphorylase 
unknown, similar to beta- 
glucosidases 
Unknown, similar to 
protein-export membrane 
protein SecDF 
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SeqID n° 1 347 LM-272. 1 From 2805440 to 2807398 

SeqID n° 1348 LM-2720.1 From 1564343 to 1564633 

SeqID n° 1349 LM-2721.1 From 1564776 to 1565105 

SeqID n° 1350 LM-2722.1 From 1565139 to 1566278 

SeqID n° 1351 LM-2723.1 From 1566365 to 1567393 

SeqID n° 1352 LM-2725.1 From 1567397 to 1568404 

SeqID n° 1353 LM-2726.1 From 1568420 to 1569025 

SeqID n° 1354 LM-2728.1 From 1569149 to 1570084 

SeqID n° 1355 LM-2729.1 From 1570154 to 1570879 

SeqID n° 1356 LM-273.1 From 2807459 to 2810107 

SeqID n° 1357 LM-2730.1 From 1570990 to 1571838 

SeqID n° 1358 LM-2732.1 From 1571905 to 1573194 

SeqID n° 1359 LM-2735.1 From 1573354 to 1574847 

SeqID n° 1360 LM-2736.2 From 1574922 to 1575740 

SeqID n° 1361 LM-2737.2 From 623251 to 624423 

SeqID n° 1362 LM-274.1 From 2810109 to 2811791 

SeqID n° 1363 LM-2741.1 From 620805 to 623135 

SeqID n° 1364 LM-2743.3 From 618932 to 620380 

SeqID n° 1365 LM-2746.1 From 617660 to 618844 

SeqID n° 1366 LM-2747.1 From 616973 to 617629 

SeqID n° 1367 LM-275.1 From 281 1788 to 2812921 



Unknown, similar to PTS 
system, fructose-specific 
IIABC component 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to tRNA- 
guanine transglycosylase 
Tgt 

Unknown, similar to S- 
adenosylmethionine.tRNA 
ribosyltransferase- 
isomerase 
unknown, highly similar to 
Holliday junction DNA 
helicase RuvB 
unknown, highly similar to 
Holliday junction DNA 

helicase (ruvA) 
Unknown, similar to L- 
lactate dehydrogenase 
Unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
sugar hydrolase 
Unknown, similar to 
prephenate dehydratase 
PheA 

Unknown, conserved GTP 
binding protein 
Unknown, similar to 

glycerol kinase 
Unknown, similar to 
glycerol uptake facilitator 

Unknown, conserved 
hypothetical membrane 
protein 
Unknown, similar to 
Sucrose phosphorylase 

Unknown, similar to 
preprotein translocase 
SecA subunit 
P60 extracellular protein, 
invasion associated protein 
lap 

Unknown, conserved 
hypothetical protein 
Unknown, weakly similar to 
carboxylesterase 
Unknown, conserved 
hypothetical protein 
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SeqID n° 


1368 


LM-2752.1 


From 615825 


to 616577 


SeqID n° 1369 
SeqID n° 1370 
SeqID n° 1371 


LM-2753.1 
LM-2754.1 
LM-2755.2 


From 615353 
From 613836 
From 612678 


to 61 5811 
to 615299 
to 613406 


SeqID n° 


1372 


LM-2758.2 


From 2343858 


to 2345150 


SeqID n° 
SeqID n° 


1373 
1374 


LM-2759.1 
LM-276.1 


From 2345260 
From 2812925 


to 2345526 
to 2813881 


SeqID n° 


1375 


LM-2760.1 


From 2345567 


to 2346088 


SeqID n° 


1376 


LM-2761.1 


From 2346252 


to 2346596 


SeqID n° 


1377 


LM-2762.1 


From 2346935 


to 2347285 


SeqID n° 


1378 


LM-2763.1 


From 2347276 


to 2348151 


SeqID n° 


1379 


LM-2764.1 


From 2348231 


to 2348539 


SeqID n° 


1380 


LM-2765.1 


From 2348604 


to 2349356 


SeqID n° 


1381 


LM-2766.1 


From 2349475 


to 2350320 


SeqID n° 


1382 


LM-2768.1 


From 2350398 


to 2351300 


SeqID n° 


1383 


LM-2769.1 


From 2351384 


to 2351746 


SeqID r>° 


1384 


LM-2770.1 


From 2351767 


to 2352615 


SeqID n° 


1385 


LM-2771.1 


From 2352616 


to 2356323 


SeqID n° 


1386 


LM-2772.3 


From 2356325 


to 2359798 


SeqID n° 1387 
SeqID n° 1388 


LM-2774.2 
LM-2776.1 


From 2097176 
From 2096075 


to 2099941 
to 2097064 


SeqID n° 


1389 


LM-2778.1 


From 2095366 


to 2096061 



Unknown, putative 
conserved membrane 
protein 
Unknown 
Unknown 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
transcriptional regulator 
(Lacl family) 
Unknown, similar to 
unknown proteins 
Unknown, hypothetical 
CDS 

unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 
IIA 

Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
unknown, similar to 

unkown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to ATP- 
dependent 
deoxyribonuclease (subunit 
A) 

Unknown, similar to ATP- 
dependent 
deoxyribonuclease (subunit 
B) 

isoleucyl-tRNA synthetase 
Unknown, similar to 

diaminopimelate epimerase 
Unknown, similar to 
unknown proteins 
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oeqiu n 




t n AT 070 A 

LlvI-278.1 


From 2813949 


t f\ ^— y f**AAA 

to 2815283 


SeqID n° 


1391 


LM-2780.2 


From 2091698 


to 2094808 


SeqID n° 


1392 


LM-2781.3 


From 2504300 


to 2505157 


SeqID n° 


1393 


LM-2782.1 


From 2503931 


to 2504236 


SeqID n° 


1394 


LM-2784.1 


From 2502401 


to 2503804 


SeqID n° 


1395 


LM-2785.1 


From 2501603 


to 2502361 


SeqID n° 


1396 


LM-2788.1 


From 2500179 


to 2501150 


SeqID n° 


1397 


LM-2790.1 


From 2499169 


to 2500179 


SeqID n° 


1398 


LM-2792.1 


From 2498381 


to 2499172 


SeqID n° 


1399 


LM-2793.1 


From 2497069 


to 2498238 


SeqID n° 


1400 


LM-2794.1 


From 2495818 


to 2496993 


SeqID n° 


1401 


LM-2795.2 


From 2495309 


to 2495662 


SeqID n° 


1402 


LM-2796.1 


From 1674961 


to 1676325 


SeqID n° 1403 


LM-2797.1 


From 1674359 


to 1674964 


SeqID n° 


1404 


LM-2798.1 


From 1673368 


to 1674387 


SeqID n° 


1405 


LM-2799.1 


From 1672613 


to 1673371 


SeqID n° 


1406 


LM-28.1 


From 2731976 


to 2732455 


SeqID n° 


1407 


LM-280.1 


From 2815401 


to 2816090 



Unknown, conserved 
hypothetical protein similar 
to hypothetical hemolysin 
Unknown, similar to alpha- 
mannosidase 
Unknown, similar to 
transcription antiterminator 
Unknown, similar to B. 
subtilis YfhL protein 
unknown, highly similar to 
glutamate decarboxylases 
Unknown, similar to 
acetylesterase 
Unknown, similar to B. 
subtilis ferrichrome ABC 
transporter fhuD precursor 
(ferrichrome-binding 
protein) 

Unknown, similar to B. 
subtilis ferrichrome ABC 
transporter (permease) 
FhuG 

Unknown, similar to B. 
subtilis ferrichrome ABC 
transporter (ATP-binding 

protein) FhuC 
Unknown, similar to cell 
division proteins RodA, 
FtsW 

Unknown, similar to cell 
division proteins RodA, 
FtsW 

Unknown, conserved 
hypothetical proteins 
unknown, highly similar to 
anthranilate synthase alpha 

subunit 
unknown, highly similar to 
anthranilate synthase beta 

subunit 
unknown, highly similar to 

anthranilate 
phosphoribosyltransferase 
unknown, highly similar to 
indol-3-glycerol phosphate 
synthases 
Unknown, similar to 
spermidine/spermine N1- 
acetyl transferase 
unknown, similar to 
regulatory proteins of the 
SIR2 family 
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SeqID n° 


1408 


LM-2801.1 


From 1672008 


to 1672616 


SeqID n° 


1409 


LM-2804.1 


From 1670803 


to 1672005 


SeqID n° 


1410 


LM-2805.1 


From 1670037 


to 1670810 


SeqID n° 
SeqID n° 


1411 
1412 


LM-2806.1 
LM-2808.1 


From 1669529 
From 1667831 


to 1669966 
to 1669444 


SeqID n° 


1413 


LM-2809.1 


From 1666107 


to 1667720 


SeqID n° 


1414 


LM-2810.1 


From 1665444 


to 1666097 


SeqID n° 


1415 


LM-2811.2 


From 1664575 


to 1665405 


SeqID n° 
SeqID n° 


1416 
1417 


LM-2812.2 
LM-2813.2 


From 49585 
From 49934 


to 49878 
to 50470 


SeqID n° 
SeqID n° 


1418 
1419 


LM-2816.1 
LM-2817.1 


From 50907 
From 51775 


to 51518 
to 52389 


SeqID n° 


1420 


LM-2818.1 


From 52630 


to 53925 


SeqID n° 


1421 


LM-2819.1 


From 53944 


to 54672 


SeqID n° 
SeqID n° 


1422 
1423 


LM-282.1 
LM-2820.1 


From 2816083 
From 54839 


to 2816373 
to 56812 


SeqID n° 
SeqID n° 


1424 
1425 


1 ft m O O O O si 

LM-2822.1 
LM-2823.2 


From 56815 
From 57286 


to 57261 
to 58638 


SeqID n° 


1426 


LM-2826.1 


From 2580470 


to 2581780 


SeqID n° 


1427 


LM-2827.1 


From 2581900 


to 2583105 


SeqID n° 


1428 


LM-2829.1 


From 2583181 


to 2584065 


SeqID n° 


1429 


LM-283.1 


From 2816363 


to 2817571 


SeqID n° 


1430 


LM-2831.1 


From 2584055 


to 2584741 


SeqID n° 


1431 


LM-2832.1 


From 2585245 


to 2586108 



phosphoribosyl anthranilate 

isomerase 
unknown, highly similar to 
tryptophan synthase (beta 

subunit) 
unknown, highly similar to 
tryptophan synthase (alpha 
subunit) 
unknown 
Unknown, similar to 
putative transporters 
Unknown, similar to 
putative transporters 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
SOS ribosomal protein S6 
unknown, highly similar to 
single-strand binding 
protein (SSB) 
unknown 
Unknown, similar to 
Staphylococcus AgrB 
protein 

Unknown, similar to sensor 
histidine kinase (AgrC from 
Staphylococcus) 
Unknwon, similar to 2- 
components response 
regulator protein (AgrA 
from Staphylococcus) 
Unknown 
Unknown, highly similar to 
B. subtilis YybT protein 
50S ribosomal protein L9 
unknown, highly similar to 
replicative DNA helicases 
Unknown, similar to cell 

wall binding proteins 
peptidoglycan lytic protein 
P45 

unknown, highly similar to 
cell-division protein FtsX 
Unknown, similar to drug- 
efflux transporters 
unknown, highly similar to 
the cell-division ATP- 
binding protein FtsE 
Unknown, similar to 
conserved hypothetical 
proteins 
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SeqiD n° 1432 LM-2833.1 From 25861 14 to 2587097 



SeqID n° 1433 LM-2834.2 From 2587300 to 2589813 

SeqIDn°1434 LM-2835.2 From 1052509 to 1053429 
SeqID n° 1435 LM-2836.1 From 1051695 to 1052354 



SeqID n° 1436 LM-2838.1 From 1051038 to 1051676 

SeqID n° 1437 LM-2839.1 From 1049983 to 1051041 

SeqID n° 1438 LM-284.1 From 2817688 to 2818035 

SeqID n° 1439 LM-2840.1 From 1049273 to 1049986 

SeqID n° 1440 LM-2842.1 From 1048266 to 1049138 

SeqID n° 1441 LM-2843.1 From 1047490 to 1048185 

SeqID n° 1442 LM-2845.1 From 1046886 to 1047377 

SeqID n° 1443 LM-2846.1 From 1045869 to 1046771 

SeqID n° 1444 LM-2847.1 From 1045007 to 1045855 

SeqID n° 1445 LM-285.1 From 2818054 to 2818698 

SeqID n° 1446 LM-2851.1 From 1043821 to 1045014 

SeqID n° 1447 LM-2853.1 From 1042605 to 1043450 

SeqID n° 1448 LM-2854.1 From 1041446 to 1042561 

SeqID n° 1449 LM-2856.1 From 1040672 to 1041382 



unknown, highly similar to 
peptide chain release 
factor 2 
translocase binding subunit 
(ATPase) 
unknown 
unknown, similar to a 
bacterial K(+)-uptake 
system 
unknown, similar to two- 
component response 
regulator, in particular B. 

subtilis YvqC protein 
unknown, similar to two- 
component sensor histidine 
kinase in particular B. 
subtilis YvqE protein 

unknown 
unknown, similar to B. 
subtilis YvqF protein 
unknown, similar to B. 
subtilis YitL protein 
unknown, similar to E. coli 
copper homeostasis 

protein CutC 
unknown, similar to 
phosphotransferase 
system glucose-specific 

enzyme IIA 
unknown, highly similar to 
glycine betaine ABC 
transporters (glycine 
betaine-binding protein) 
unknown, highly similar to 

glycine betaine ABC 
transporters (permease) 
Unknown, similar to 
transaldolase 
unknown, highly similar to 

glycine betaine ABC 
transporter (ATP-binding 
protein) 
unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 
YkuT protein 
unknown, similar to N-acyl- 
L-amino acid 
amidohydrolases 
unknown, similar to 
tetrahydrodipicolinate 
succinylase 
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SeqID n° 1450 LM-2857.1 From 1039746 to 1040624 

SeqID n° 1451 LM-2858.1 From 1039297 to 1039749 

SeqID n° 1452 LM-2859.2 From 1038863 to 1039096 

SeqID n° 1453 LM-286.1 From 2818841 to 2819530 

SeqID n° 1454 LM-2860.1 From 600837 to 601 148 

SeqID n° 1455 LM-2861.1 From 601149 to 601466 

SeqID n° 1456 LM-2863.1 From 601463 to 602218 

SeqID n° 1457 LM-2864.1 From 602208 to 602930 

SeqID n° 1458 LM-2866.1 From 602909 to 603535 

SeqID n° 1459 LM-2867.1 From 603536 to 604120 

SeqID n° 1460 LM-2868.1 From 604121 to 605404 

SeqID n° 1461 LM-287.1 From 2819860 to 2821587 

SeqID n° 1462 LM-2870.1 From 605401 to 606042 

SeqID n° 1463 LM-2871 ,2 From 606039 to 607220 

SeqID n° 1464 LM-2872.2 From 1645958 to 1646413 

SeqID n° 1465 LM-2873.3 From 1646669 to 1647781 

SeqID n° 1466 LM-2875.1 From 1647821 to 1648366 

SeqID n° 1467 LM-2877.1 From 1648511 to 1649854 



SeqID n° 1468 LM-2878.2 From 1650149 to 1652500 

SeqID n° 1469 LM-288.1 From 2821641 to 2821892 
SeqID n° 1470 LM-2880.1 From 1652810 to 1653427 



unknown, similar to 
transcription regulator 

(LysR family), 
unknown, similar to B. 
subtilis YkuL protein 
unknown, similar to B. 
subtilis YkuJ protein 
Unknown, weakly similar to 
transcription regulators 
CRP/FNR family 
Unknown, similar to 
phosphorybosil-AMP- 
cyclohydrolase (Hisl2 

protein) 
Unknown, similar to 
phosphoribosyl-AMP 
cyclohydrofase (HisI1 
protein) 
unknown, highly similar to 

cyclase HisF 
unknown, highly similar to 
phosphoribosylformimino- 
5-aminoimidazoIe 
carboxamide ribotide 

isomerase 
unknown, similar to 
amidotransferases 
imidazoleglycerol- 
phosphate dehydratase 
unknown, highly similar to 
histidinol dehydrogenases 
Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ATP 
phosphoribosyltransferase 
histidyl-tRNA synthetase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
aminopeptidase 
Unknown, similar to 2-cys 

peroxiredoxin 
unknown, similar to UDP- 
N-acetyl muramate-alanine 

ligases 
Unknown, similar to DNA 
translocase 
Unknown 
Unknown, similar 
phenylalanyl-tRNA 
synthetase (beta subunit) 
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SeqID n° 


1471 


LM-2881.1 


From 1653433 


SeqID n° 


1472 


LM-2882.1 


From 1654268 


SeqID n° 


1473 


LM-2883.1 


From 1654902 


SeqID n° 


1474 


LM-2884.1 


From 1656173 


SeqID n° 


1475 


LM-2885.1 


From 1656521 


SeqID n° 


1476 


LM-2886.1 


From 1656903 


SeqID n° 


1477 


LM-2887.2 


From 1657815 


SeqID n° 


1478 


LM-2889.2 


From 1658466 


SeqID n° 
SeqID n° 


1479 
1480 


LM-289.1 
LM-2890.2 


From 2821912 
From 1307009 


SeqID n° 


1481 


LM-2892.1 


From 1306095 


SeqID n° 


1482 


LM-2894.1 


From 1305781 


SeqID n° 


1483 


LM-2895.1 


From 1305390 


SeqID n° 


1484 


LM-2898 1 


From 1304457 


SeqID n° 


1485 


LM-29.1 


From 2732589 


SeqID n° 


1486 


LM-290.1 


From 2823530 


SeqID n° 


1487 


LM-2900.1 


From 1303027 


SeqID n° 


1488 


LM-2901.1 


From 1302474 


SeqID n° 


1489 


LM-2902.2 


From 1301551 


SeqID n° 


1490 


LM-2903.2 


From 1299964 


SeqID n° 


1491 


LM-2907.2 


From 1297823 


SeqID n° 


1492 


LM-2908.2 


From 1296691 



to 1654233 Unknown, similar to 

unknown proteins 
to 1654579 Unknown, similar to 

thioredoxin 
to 1655975 Unknown, similar to 

aminopeptidase 
to 1656484 Unknown, similar to 

unknown proteins 
to 1656769 Unknown, similar to 

unknown proteins 
to 1657754 Unknown, similar to 

unknown proteins 
to 1658459 Unknown, similar to 

unknown proteins 
to 1659248 Unknown, similar to 

unknown proteins 
to 2823195 seryl-trna synthetase 
to 1307605 unknown, conserved 

hypothetical protein, similar 
to B. subtilis YneS protein 

to 1306967 Unknown, similar to 
Lactococcus lactis LacX 
protein 

to 1306092 Unknown, similar to B. 

subtilis YneQ protein 
to 1305758 Unknown, similar to B. 

subtilis YneP protein 
to 1305236 unknown, highly similar to 

B. subtilis CodY protein 
to 2733233 Unknwon, similar to 
ribulose-phosphate 3- 
epimerase 
to 2823949 Unknown, similar to B. 

subtilis stress protein YdaG 
to 1 304436 unknown, highly similar to 
ATP~dependent Clp 
protease-like proteins 
to 1303013 unknown, highly similar to 
beta-type subunit of the 
20S proteasome 
to 1302453 unknown, similar to 

integrase/recombinase 
to 1301268 unknown, similar glucose 
inhibited division protein A 
to 1299901 unknown, highly similar to 
DNA topoisomerase I TopA 
to 1297551 Unknown, similar to 

polypeptide deformylase, 
similar to B. subtilis Smf 
protein 
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SeqID n° 


1493 


LM-2909.1 


From 1295772 


to 1296557 


SeqID n° 1494 


LM-291.1 


From 2824079 


to 2824651 


SeqID n° 


1495 


LM-2910.1 


From 1294912 


to 1295775 


SeqID n° 


1496 


LM-2911.1 


From 1294360 


to 1294902 


SeqID n° 


1497 


LM-2912.2 


From 1293689 


to 1294258 


SeqID n° 


1498 


LM-2914.2 


From 1498096 


to 1500252 


SeqID n° 


1499 


LM-2915.1 


From 1500268 


to 1501227 


SeqID n° 


1500 


LM-2916.1 


From 1501415 


to 1501861 


SeqID n° 


1501 


LM-2917.1 


From 1502252 


to 1503019 


SeqID n° 


1502 


LM-2918.1 


From 1503020 


to 1503964 


SeqID n° 


1503 


LM-292.1 


From 2824648 


to 2826354 


SeqID n° 1504 
SeqID n° 1505 


LM-2921.1 
LM-2922 1 


From 1504037 

From 1^0^19 
riuiii uujj i /_ 


to 1505170 

to 1^071^ 


SeqID n° 
SeqID n° 


1506 
1507 


LM-2924 1 
LM-2925.1 


From 1507187 
From 1507804 


to 150776? 
to 1508841 


SeqID n° 


1508 


LM-2926.1 


From 1508988 


to 1510145 


SeqID n° 


1509 


LM-2928.1 


From 1510234 


to 1511262 


SeqID n° 


1510 


LM-2929.1 


From 1511375 


to 1511812 


SeqID n° 


1511 


LM-2931 .3 


From 1511858 


to 1513684 


SeqID n° 


1512 


LM-2932.2 


From 2535342 


to 2536268 


SeqID n° 


1513 


LM-2933.1 


From 2533840 


to 2535183 


SeqID n° 1514 


LM-2935.1 


From 2532352 


to 2533398 



unknown, similar to 
ribonuclease H rnh 
Unknown, similar to 

glutamine 
amidotransferase 
unknown, conserved 
hypothetical protein similar 
to B. subtilis YlqF protein 
Unknown, similar to signal 

peptidase I 
Unknown, similar to signal 
peptidase I 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phosphate starvation 
induced protein PhoH 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
ribosomal protein L1 1 
methyltransferase 
Unknown, similar to para- 
aminobenzoate synthase 

component I 
heat shock protein DnaJ 
class I heat-shock protein 
(molecular chaperone) 
DnaK 

heat shock protein GrpE 
transcription repressor of 
class I heat-shock gene 
HrcA 

unknown, highly similar to 
coproporphyrinogen III 
oxidase 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
transcriptional regulator 

(MerR family) 
unknown, highly similar to 
GTP-binding protein LepA 
Unknown, similar to 
dipeptidases 
RNA polymerase sigma-54 
factor (sigma-L) 
Unknown, similar to B. 
subtilis CggR hypothetical 
transcriptional regulator 
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SeqID 


n° 


1515 


LM-2936.1 


From 2531310 


SeqID 


n" 


1516 


LM-2937.1 


From 2529985 


SeqID 


n° 


1517 


LM-2939.1 


From 2529184 


SeqID 


n° 


1518 


LM-294.1 


From 2826515 


SeqID 


n° 


1519 


LM-2942.1 


From 2527650 


SeqID 


n° 


1520 


LM-2944.2 


From 2526222 


SeqID 
SeqID 


n° 
n° 


1521 
1522 


LM-2945.1 
LM-2946.2 


From 2525946 
From 2525094 


SeqID 


n° 1523 


LM-2950.2 


From 316873 


SeqID 


n° 


1524 


LM-2952 1 


From 315945 


SeqID 


n° 


1525 


LM-2953 1 


From 314984 


SeqID 


n° 


1526 


LM-2954.1 


From 313659 


SeqID 


n° 


1527 


LM-2957.1 


From 311830 


SeqID 


n° 


1528 


LM-2958.1 


From 310932 


SeqID 


n° 


1529 


LM-2959.2 


From 309538 


SeqID 


n° 


1530 


LM-296.1 


From 2828236 


SeqID 


n° 


1531 


LM-2960.3 


From 2692097 


SeqID 
SeqID 
SeqID 


n° 
n° 
n° 


1532 
1533 
1534 


LM-2962.2 
LM-2964.2 
LM-2965.1 


From 2693151 
From 2693563 
From 2694164 


SeqID 


n° 


1535 


LM-2967.1 


From 2694769 


SeqID 


n° 


1536 


LM-2968.1 


From 2695476 


SeqID 


n° 1537 


LM-297.1 


From 2830284 



to 2532320 unknown, highly similar to 
glyceraldehyde 3- 
phosphate dehydrogenase 
to 2531 1 75 unknown, highly similar to 
phosphoglycerate kinase 
to 2529939 unknown, highly similar to 
triose phosphate 
isomerase 
to 2828236 Unknown, similar to ABC 
transporter, ATP-binding 
protein 

to 25291 82 unknown, highly similar to 
phosphoglycerate mutase 
to 2527514 unknown, highly similar to 

enolase 

to 25261 19 Unknown 

to 2525813 Unknown, similar to 

lipolytic enzyme 
to 318375 Unknown, similar to heat- 
shock protein htrA serine 
protease 

to 316775 Unknown, conserved 

hypothetical protein similar 
to B. subtilis YycJ protein 
to 315823 Unknown, similar to B. 

subtilis Yycl protein 
to 314981 Unknown, similar to B. 

subtilis YycH protein 
to 313662 Unknown, similar to two- 
component sensor histidine 
kinase 

to 31 1645 Unknown, similar to two- 
component response 
regulator 

to 310719 Unknown, similar to 

aminotransferase 
to 2830008 Unknown, similar to ABC 
transporter, ATP-binding 
protein 

to 2693041 unknown, highly similar to 
RNA polymerase (alpha 
subunit) 

to 2693540 ribosomal protein S1 1 
to 2693928 ribosomal protein S1 3 
to 2694382 unknown, highly similar to 

initiation factor IF-I 
to 2695416 unknown, highly similar to 

adenylate kinases 
to 2696771 unknown, highly similar to 
preprotein translocase 
subunit 

to 2830523 Unknown 
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SenID 


n° 


1538 


LM-2970 1 


From 


2696771 


to 2697211 

lw £-*VW I 1 1 


ribosomal Drotein L15 




n° 


1539 


LM-2972 1 


From 


2697463 


to 2697966 


ribosomal orotein S5 


SeqID 


n° 


1540 


LM-2973 1 


From 


2697988 


to 2698347 


ribosomal protein L18 


SeqID 


n° 


1541 


LM-2974.1 


From 


2698387 


to 2698923 


ribosomal protein L6 


SeqID 


n° 


1542 


LM-2975.1 


From 


2698954 


to 2699352 


ribosomal protein S8 


SeqID 


n° 


1543 


LM-2977.1 


From 


2699600 


to 2700139 


ribosomal protein L5 


SeqID 


n° 


1544 


LM-2979.1 


From 


2700166 


to 2700477 


ribosomal protein L24 


SeqID 


n° 


1545 


LM-298.1 


From 


2830542 


to 2831879 


Unknown, similar to D- 














alanyl-D-alanine 
















carboxypepticiase 
















(penicillin-binding protein 5) 


SeqID 


n° 


1546 


LM-2981.1 


From 


2700515 


to 2700883 


ribosomal protein L14 


SenID 


n° 


1547 


LM-2983 1 


From 


2701435 


to 2701869 


ribosomal orotein 116 


SeqID 


n° 


1548 


LM-2984 1 

L.IVI A— \* **** t m 1 


From 


2701872 


to 2702528 


ribosomal protein S3 


SeqID 


n° 


1549 


LM-2985 2 


From 


2702532 


to 2702888 


ribosomal protein L22 


SeqID 




1 \J\J\J 




t i \Ji 1 1 


970990Q 


tn 970^187 


rihncxomal nrntpin 5^1 Q 

1 IUUOUI I ICll yji UlCII 1 v_> 1 \J 




n° 


I I 




Prnm 


1 «J\70 1 UU 




i inknnwn hinhh/ Qimilx^r tn 

\A\ lUVVI 1, ! "y 1 \ \y Oil 1 1 ItCJI LU 














alutamvl-tRNA reductase 


SeqID 


n 




1 M 9QQ1 1 


rlUill 


1 ^Q91R7 

I JcJZ I Q / 


\SJ IvJv70vJv7VJ 


i inknmA/n h in h l\/ cimilar to 
ui ii\i iuwi 1 , Hiyi uy oiiiiiicii i\J 






























dpaminasps 
















(hvdroxvmethvlbilane 
















synthase) 


SeqID 


n° 


1553 


LM-2992.1 


From 


1591448 


to 1592170 


Unknown, similar to 














uroporphyrinogen III 
















cosynthase (HemD) 


SeqID 


n° 


1554 


LM-2993.1 


From 


1590477 


to 1591451 


unknown, highly similar to 














delta-aminolevulinic acid 
















dehydratases 
















(porphobilinogen synthase) 


SeqID 


n° 


1555 


LM-2995.1 


From 


1589175 


to 1590464 


unknown, highly similar to 














glutamate-1-semialdehyde 
















2 , 1 -aminotransferases 


SeqID 


n° 


1556 


LM-2997.1 


From 


1586201 


to 1588852 


valyl-tRNA synthetase 


SeqID 


n° 


1557 


LM-2999.2 


From 


1584850 


to 1586139 


unknown, similar to Folyl- 














polyglutamate synthetase 


SeqID 


n° 


1558 


LM-3.1 


From 


2712931 


to 2713365 


Unknown 


SeqID 


n° 


1559 


LM-3000.1 


From 


1583928 


to 1 584638 


Unknown, similar to B. 














subtilis late competence 
















protein ComC (type IV 
















prepilin peptidase) 


SeqID 


n° 


1560 


LM-3001.1 


From 


1583242 


to 1583916 


Unknown, similar to DNA 














repair protein RadC 


SeqID 


n° 


1561 


LM-3004.1 


From 


1581777 


to 1582790 


unknown, similar to cell- 














shape determining protein 
















MreB 


SeqID 


n° 


1562 


LM-3005.1 


From 


1580804 


to 1581691 


unkown, similar to cell- 














shape determining protein 
















MreC 


SeqID 


n° 


1563 


LM-3006.1 


From 


1580283 


to 1580801 


unknown, similar to cell- 














shape determining protein 



MreD 
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SeqID 


n° 


1564 


LM-3007.2 


From 


1579428 


to 


1580105 


SeqID 


n° 


1565 


LM-301.1 


From 


2832058 


to 


2833725 


SeqID 


n° 


1566 


LM-3010.1 


From 


1149406 


to 


1 149720 


SeqID 


n° 


1567 


LM-3011.1 


From 


1149011 


to 


1149385 


SeqID 


n° 


1568 


LM-3012.1 


From 


1147603 


to 


1 149003 


SeqID 


n° 


1569 


LM-3013.1 


From 


1146228 


to 


1147412 


SeqID 


n° 


1570 


LM-3014.1 


From 


1145941 


to 


1146231 


SeqID 


n° 


1571 


LM-3016.1 


From 


1145211 


to 


1145711 


SeqID 


n° 


1572 


LM-3017.1 


From 


1144533 


to 


1144928 


SeqID 


n° 


1573 


LM-3018.1 


From 


1142099 


to 


1 144549 


SeqID 


n° 


1574 


LM-3020.1 


From 


1139940 


to 


1 142099 


SeqID 


n° 


1575 


LM-3022.1 


From 


1138930 


to 


1139940 


SeqID 


n° 


1576 


LM-3023.1 


From 


1137997 


to 


1138914 


SeqID 


n° 


1577 


LM-3024.1 


From 


1137533 


to 


1137868 


SeqID 


n° 


1578 


LM-3025.3 


From 


1254459 


to 


1254722 


SeqID 


n° 


1579 


LM-3027.3 


From 


1253379 


to 


1254311 


SeqID 


n° 


1580 


LM-3028.1 


From 


1252667 


to 


1253341 


SeqID 


n° 


1581 


LM-3029.1 


From 


1249360 


to 


1252560 


SeqID 


n° 


1582 


LM-3030.1 


From 


1248892 


to 


1249344 


SeqID 


n° 


1583 


LM-3031.2 


From 


1245384 


to 


1248794 


SeqID 


n° 


1584 


LM-3032.2 


From 


1244669 


to 


1245370 


SeqID 


n° 


1585 


LM-3035.2 


From 


1608166 


to 


1609923 



unknown, similar to cell- 
division inhibition (septum 
placement) protein MinC 

Unknown, similar to 
acylase and diesterase 
unknown, highly similar to 

TN916 0RF23 
unknown, highly similar to 

TN916 0RF22 
unknown, highly similar to 

TN916 0RF21 
unknown, highly similar to 
TN916ORF20 
unknown, similar to 
unknown proteins 
unknown, highly similar to 

TN916 0RF18 
unknown, highly similar to 

TN916 0RF17 
unknown, highly similar to 

TN916 0RF16 
unknown, highly similar to 

TN916 0RF15 
unknown, highly similar to 
TN916 ORF14 and to L. 
monocytogenes P60 
protein 
unknown, highly similar to 
TN916 0RF13 
unknown, similar to 
cadmium efflux system 

accessory proteins 
unknown, similar to B. 
subtilis YshA protein 
unknown, similar to B. 
subtilis ribonuclease Hill 
Unknown, similar to uracil- 
DNA glycosylase 
unknown; similar to 
transporter, (to B. subtilis 
YdgH protein) 
unknown; similar to 
transcriptional regulator 
(MarR family), 
unknown, similar to 
different proteins 
unknown, similar to ABC 
transporter, ATP-binding 

proteins 
unknown, highly similar to 
pyruvate kinases 
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SeqID 


n° 


1586 


LM-3036.1 


From 


1607671 


to 


1608048 


Unknown, similar to 
unknown proteins 


SeqID 


n° 


1587 


LM-3037 1 

1— 1 V 1 \~t w \-r t m 1 


From 


1607053 


to 


1607514 


Unknown similar to 
unknown proteins 


SeqID 


n° 


1588 


LM-3038.1 


From 


1605895 


to 


1607016 


unknown, highly similar to 
citrate synthase subunit II 


SeqID 


n° 


1589 


LM-3039.1 


From 


1604613 


to 


1605875 


unknown, highly similar to 
isocitrate dehyrogenases 


SeqID 


n° 


1590 


LM-304.1 


From 


2833834 


to 


2835987 


Unknown, similar to DNA 
topoisomerase III 


SeqID 


n° 


1591 


LM-3041.1 


From 


1601820 


to 


1604447 


DNA polymerase 1 


SeqID 


n° 


1592 


LM-3042.1 


From 


1600975 


to 


1601796 


unknown, highly similar to 



formamidopyrimidine-DNA 
glycosylases 

SeqID n° 1593 LM-3043.2 From 1600356 to 1600958 Unknown, similar to 

unknown proteins 

SeqID n° 1594 LM-3044.2 From 1875574 to 1876560 Unknown, similar to FtsY of 

E. coli and SRP receptor 
aipha-subunit 

SeqID n° 1595 LM-3049.1 From 1876575 to 1880135 Unknown, similar to Smc 

protein essential for 
chromosome condensation 
and partition 

SeqID n° 1596 LM-3050.1 From 1880158 to 1880847 . Unknown, similar to 

ribonuclease III 

SeqID n° 1597 LM-3053.1 From 1881381 to 1882124 Unknown, similar to 3- 

ketoacyl-acyl carrier protein 
reductase 

SeqID n° 1598 LM-3054.1 From 1882128 to 1883069 Unknown, similar to 

malonyl CoA-acyl carrier 
protein transacylase 

SeqID n° 1599 LM-3056.1 From 1883062 to 1884075 Unknown, similar to pIsX 

protein involved in fatty 
acid/phospholipid synthesis 
SeqID n° 1600 LM-3058.2 From 1884096 to 1884665 Unknown, similar to 

unknown proteins 

SeqID n° 1601 LM-3059.2 From 817914 to 818762 Unknown, similar to 

conserved hypothetical 
proteins 



SeqID 


n° 


1602 


LM-3062.1 


From 813451 


to 817878 


Unknown 


SeqID 


n° 


1603 


LM-3065.3 


From 811813 


to 813189 


Unknown, similar to amino 














acid transporter 


SeqID 


n° 


1604 


LM-3066.2 


From 2159224 


to 2159604 


Unknown 


SeqID 


n° 


1605 


LM-3067.1 


From 2157949 


to 2159094 


unknown 


SeqID 


n° 


1606 


LM-3068.1 


From 2157383 


to 2157844 


Unknown, similar to 














unknown proteins 


SeqID 


n° 


1607 


LM-3069.1 


From 2156694 


to 2157386 


Unknown, similar to 














glycoprotease 


SeqID 


n° 


1608 


LM-307.1 


From 2836011 


to 2837783 


Unknown, similar to ATP- 



dependent DNA helicases 
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SeqID n° 


1609 


LM-3070.1 


From 2156242 


to 2156697 


Unknown, similar to 
ribosomal protein alanine 
acetyltransferase 


SeqID n° 


1610 


LM-3072.1 


From 2155223 


to 2156245 


Unknown, similar to 
glycoprotein endopeptidase 


SeqID n° 


1611 


LM-3073.1 


From 2153720 


to 2154679 


Unknown, similar to 
unknown proteins 


SeqID n° 


1612 


LM-3074.1 


From 2151744 


to 2153696 


Unknown, similar to ABC 
transporter (ATP-binding 
protein) 


SeqID n° 1613 


LM-3075.1 


From 2150786 


to 2151433 


Unknown, similar to a 












putative DNA binding 












proteins 


SeqID n° 


1614 


LM-3076.2 


From 2149868 


to 2150554 


Unknown, similar to 
unknown proteins 


SeqID n° 


1615 


LM-3077.2 


From 992978 


to 993886 


unknown, similar to 


SeqID n° 












1616 


LM-3080.1 


From 993900 


to 995126 


unknown, similar to 
oro teases 


SeqID n° 


1617 


LM-3081.1 


From 995210 


to 995767 


unknown, Listeria epitope 
LemA 


SeqID n° 


1618 


LM-3083 1 

bill \J Km* a | 


From 995790 


to 99fi704 


Ul Ir\i IUWI 1, biillllcil IO 

putative heat shock protein 
HtpX, Listeria epitope 


SeqID n° 










LemB 


1619 


LM-3084.1 


From 996739 


to 997557 


unknown, similar to B. 
subtilis YjbH protein 


SeqID n° 


1620 


LM-3085.1 


From 997795 


to 998379 


unknown, similar to B. 
subtilis YjbK protein 


SeqID n° 


1621 


LM-3086.2 


From 998396 


to 998863 


unknown 


SeqID n° 


1622 


LM-3087.2 


From 999022 


to 999690 


unknown, similar to B. 
subtilis YjbM protein 


SeqID n° 


1623 


LM-3088.1 


From 999722 


to 1000516 


unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 
YjbN protein 


SeqID n° 


1624 


LM-3089.1 


From 1000535 


to 1001425 


unknown similar tn 

Ul 1 r\J iVJVVl 1, OH IIIICII LU 

ribosomal large subunit 
pseudouridine synthetase 


SeqID n° 


1625 


LM-309.2 


From 2837944 


to 2839410 


Ul IW IUWI I, OffllilCU Lw 

inosine-monophosphate 


SeqID n° 




LM-3090.1 






dehydrogenase 


1626 


From 1001503 


to 1002291 


unknown, similar to enoyl- 
acyl-carrier protein 
reductase 


SeqID n° 


1627 


LM-3091.1 


From 1002319 


to 1003593 


DltD protein for D-alanine 
esterification of lipoteichoic 
acid and wall teichoic acid 
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SeqID n° 1628 LM-3092.1 From 1003855 to 1005039 DltB protein for D-alanine 

esterification of lipoteichoic 
acid and wall teichoic acid 



SeqID n° 1629 LM-3093.2 From 1005036 to 1006568 D-alanine-activating 

enzyme (dae), D-alanine- 
D-alanyl carrier protein 
ligase (del) 

SeqID n° 1630 LM-3095.2 From 1364031 to 1364570 unknown, similar to 5- 

formyltetrahydrofolate 
cyclo-ligase 

SeqID n° 1631 LM-3097.1 From 1364669 to 1366207 unknown similar to B. 

subtilis yqgP 

SeqID n° 1632 LM-3098.1 From 1366457 to 1367425 Unknown, similar to 

glucose kinase 

SeqID n° 1633 LM-3099.1 From 1367559 to 1368650 unknown, similar to B. 

subtilis YqgU protein 

SeqID n° 1634 LM-3100.1 From 1368668 to 1368985 Unknown, weakly similar to 

B. subtilis comG operon 
protein 7 (comGG) 

SeqID n° 1635 LM-3101.1 From 1368982 to 1369449 Unknown, similar to B. 

subtilis comG operon 
protein 6 

SeqID n° 1636 LM-3102.1 From 1369412 to 1369696 Unknown, similar to comG 

operon protein 5 (comGE) 

SeqID n° 1637 LM-3103.1 From 1369683 to 1370111 Unknown, similar to comG 

operon protein 4 (comGD) 
SeqID n° 1638 LM-3104.1 From 1370108 to 1370431 Unknown, similar to B. 

subtilis comG operon 
protein 3 

SeqID n° 1639 LM-3105.1 From 1370445 to 1371476 Unknown, similar to B. 

subtilis comG operon 
protein 2 

SeqID n° 1640 LM-3106.1 From 1371454 to 1372476 Unknown, similarto B. 

subtilis comG operon 
protein 1 

SeqID n° 1641 LM-3 107.1 From 1373015 to 1374103 Unknown, similarto 

aminomethyltransferase 
SeqID n° 1642 LM-3110.3 From 1374119 to 1375465 Unknown, similarto glycine 

dehydrogenase 
(decarboxy fating) subunit 1 

SeqID n° 1643 LM-3111.1 From 1375462 to 1376928 Unknown, similarto glycine 

dehydrogenase 
(decarboxylating) subunit 2 
SeqID n° 1644 LM-3112.1 From 1376968 to 1377348 unknown 
SeqID n° 1645 LM-3113.1 From 1377410 to 1377682 Unknown 
SeqID n° 1646 LM-3114.2 From 1377695 to 1378660 Unknown, similar to B. 

subtilis YqhQ protein 

SeqID n° 1647 LM-3115.3 From 1962698 to 1963243 unknown, similarto 

conserved hypothetical 
proteins 



WO 01/77335 



PCT/FR01/01118 



123 



SeqID 


n° 


1648 


LM-3116 2 


From 1962225 


to 1962566 


nnknn\A/n <;imil£irtn 

ui ir\i iuvvi I, oil I (HOI wj 

hypothetical proteins 


SeqID 


n° 


1649 


LM-3119.1 


From 1960496 


to 1961644 


unknown, similar to 
conserved hypothetical 
proteins 


SeqID 


n° 


1650 


LM-312.1 


From 130652 


to 131380 


Unknwon, similar to 
autolvsin* N- 
acetylmuramoyl-L-alanine 
amidase 


SeqID 


n° 


1651 


LM-3120.1 


From 1958968 


to 1960476 


unknown, similar to 
probable thermostable 
carboxypeptidases 


SeqID 


n° 


1652 


LM-3121.1 


From 1958162 


to 1958740 


unknown similar to 
xanthine 
ohosDhoribosvltransferase 


SeqID 


n° 


1653 


LM-3124.1 


From 1956848 


to 1958155 


unknown, similar to 
probable permeases 


SeqID 


n° 


1654 


LM-3126.1 


From 1955598 


to 1956656 


unknown, similar to 
chitinases 


SeqID 


n° 


1655 


LM-3127.2 


From 1955205 


to 1955474 


unknown, similar to 
ribosomal protein S14 


SeqID 


n° 


1656 


LM-3128.3 


From 1954255 


to 1955127 


unknown, similar to 5 -3 
exonuclease 


SeqID 


n° 


1657 


LM-3129.1 


From 248555 


to 24901 3 


Unknown, highly similar to 
transcription repressor of 
class III stress genes 
(CtsR) 


SeqID 


n° 


1658 


LM-313.1 


From 130249 


to 130671 


Unknwon, similar to a 
protein from Bacteriophage 
phi-105 (ORF45) 


SeqID 


n° 


1659 


LM-3130.1 


From 249026 


to 249544 


Unknown, similar to B. 
subtilis YacH protein 


SeqID 


n° 


1660 


LM-3131.1 


From 249541 


to 250563 


Unknwon, similar to 
arginine kinase 


SeqID 


n° 


1661 


LM-3135.1 


From 250592 


to 253054 


endopeptidase Clp ATP- 
binding chain C 


SeqID 


n° 


1662 


LM-3 136.2 


From 253200 


to 254573 


unknown, similar to DNA 
repair protein Sms 


SeqID 


n° 


1663 


LM-3 139.2 


From 254707 


to 255780 


Unknown, highly similar to 
B. subtilis YacL protein 


SeqID 


n° 


1664 


LM-314.1 


From 129694 


to 130230 


Unknwon, weakly similar to 
protein gp20 from 
Bacteriophage A1 18 


SeqID 


n° 


1665 


LM-3140.2 


From 255800 


to 256498 


Unknown, similar to 
nucleotidyl transferase; 
pyrophosphorylase 


SeqID 


n° 


1666 


LM-3141.2 


From 348586 


to 349065 


Unknown 


SeqID 


n° 


1667 


LM-3 142 .2 


From 349326 


to 350228 


Unknown, similar to 
transcriptional regulators 


SeqID 


n° 


1668 


LM-3143.1 


From 350392 


to 351264 


Unknown, similar to 
transcriptional regulators 


SeqID 


n° 


1669 


LM-3147.1 


From 355643 


to 356323 


Unknown 
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SeqID n° 1670 LM-3149.2 From 2602249 to 2602683 



SeqID n° 1671 LM-315.1 From 129218 to 129697 
SeqiD n° 1672 LM-3150.1 From 2601632 to 2601997 

SeqID n° 1673 LM-3151.1 From 2600408 to 2601241 



SeqID n° 1674 LM-31 52.1 From 2599548 to 2600282 



SeqID n° 1675 LM-3153.1 From 2598423 to 2599547 



SeqID n° 1676 LM-3154.1 From 2597363 to 2598415 



SeqID n° 1677 LM-31 55.1 From 2596266 to 2597324 



SeqID n° 1678 LM-31 56.1 From 2595485 to 2596090 
SeqID n° 1679 LM-31 57.1 From 2594908 to 2595543 



SeqID n° 1680 LM-31 58.1 From 2593837 to 2594523 



SeqID n° 1681 LM-3159.1 From 2592965 to 2593816 

SeqID n° 1682 LM-31 6.1 From 128628 to 129203 

SeqID n° 1683 LM-31 62.1 From 2591451 to 2592770 

SeqID n° 1684 LM-3163.2 From 2590802 to 2591458 



SeqID n° 1685 LM-31 64.2 From 2590044 to 2590607 

SeqID n° 1686 LM-3167.1 From 2020738 to 2021256 
SeqID n° 1687 LM-31 68.1 From 2021272 to 2023062 



Unknwon, similar to 
hydroxymyristoyl-(acyl 
carrier protein) 
dehydratase 
Unknwon 
Unknown, similar to single- 
strand DNA-binding protein 
Unknwon, similar to 
hypothetical cell wall 
binding protein from B. 

subtilis 
Unknown, similar to B. 
subtilis TagA protein 
involved in polyglycerol 
phosphate biosynthesis 
Unknwon, similar to B. 

subtilis O- 
succinylbenzoate-CoA 

synthase (MenC) 
Unknown, similar to B. 
subtilis TagO teichoic acid 
linkage unit synthesis 

protein 
Unknown, similar to B. 

subtilis putative 
transcriptional regulator 
LytR 

unknwon 
Unknown, similar to 
conserved hypothetical 

proteins 
Unknown, similar to B. 
subtilis two-component 
response regulator DegU 
Unknown, similar to B. 
subtilis YviA (DegV) protein 
unknown 
unknown, similar to late 
competence protein com FA 
unknown, similar to late 
competence protein 

comFC 
Unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 

YvyD protein 
Unknown, similar to similar 
to acyl-CoA hydrolase 
Unknown, similar to two- 
component sensor histidine 
kinase (ResE) 
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SeqID 


n° 


1688 


LM-3 169.1 


From 2023163 


to 2023879 


SeqID 


n° 

n° 


i ooy 
1690 


!_IVi-0 I / . I 

LM-3171.1 


Prnm lOR'ZIA 
r lUlli l/COO \H 

From 2024062 


LvJ IZOD lo 

to 2024796 


SeqID 


n D 


1691 


LM-3 172.1 


From 2024799 


to 2025395 


SeqID 


n° 


1692 


LM-3 174.1 


From 2025392 


to 2026141 


SeqID 


n° 


1693 


LM-3 177.1 


From 2026157 


to 2027467 


SeqID 


n° 


1694 


LM-3178.1 


From 2027648 


to 2028466 


Cpnin 

oeqi u 


n 




LJVl-O 1 / 


Prom OHO&ARR 

i ruin zuzo'foo 




SeqID 


n° 


1696 


LM-3 18 1.2 


From 2029698 


to 2030591 


SeqID 


n° 


1697 


LM-3 184.2 


From 635317 


to 636423 


SeqID 


n° 


1698 


LM-3 186.1 


From 636440 


to 637717 


SeqID 


n° 


1699 


LM-3188.1 


From 638165 


to 638692 


SeqID 


n° 


1700 


LM-3189.1 


From 638781 


to 639482 


SeqID 


n° 


1701 


LM-319.1 


From 127188 


to 128324 


SeqID 


n° 


1702 


LM-3 190.2 


From 639558 


to 640106 


SeqID 


n° 


1703 


LM-3191.2 


From 1537815 


to 1538501 


SeqID 


n° 


1704 


LM-3 192.1 


From 1538498 


to 1539937 


SeqID 


n° 


1705 


LM-3194.1 


From 1539979 


to 1542375 


SeqID 


n° 


1706 


LM-3195.1 


From 1542403 


to 1543041 


SeqID 


n° 


1707 


LM-3196.1 


From 1543082 


to 1543822 


SeqID 


n° 


1708 


LM-3198.1 


From 1543940 


to 1545055 



Unknown, similar to two- 
component response 
regulator (ResD) 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
diaminopimelate 
decarboxylase 
Unknown, similar to purine- 
nucleoside phosphorylase 
Unknown, similar to 
' phosphopentomutase 
Unknown, similar to 
integrase/recombinase 
Unknown, similar to 
homoserine O- 
acetyltransferase 
Unknown, similar to O- 
acetylhomoserine 

sulfhydrylase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulator 
CRP/FNR family 
Unknown, similar to protein 
gp18from Bacteriophage 
A118 
Unknown, similar to 
proteins involved in biotin 

metabolism (BioY) 
Unknown, similar to two- 
component response 
regulators 
Unknown, similar to two- 
component sensor histidine 
kinase 
Unknown, similar to 
exodeoxyribonuclease V 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
putative tRNA (5- 
methylaminomethyl-2- 
thiouridylate)- 
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methyltransferase 



SeqID n° 1709 LM-32.1 From 2733230 to 2735245 
SeqID n° 1710 LM-3200.2 From 1545074 to 1546222 

SeqID n° 1711 LM-3201.2 From 1527422 to 1527904 



SeqID n° 


1712 


LM-3203.2 


From 1526748 


to 1527371 


SeqID n° 


1713 


LM-3204. 1 


From 1525997 


to 1526698 


SeqID n° 


1714 


LM-3205.1 


From 1524154 


to 1525962 


SeqID n° 


1715 


LM-3206.2 


From 1523471 


to 1523992 


SeqID n° 


1716 


LM-3208.2 


From 1522374 


to 1523474 


SeqID n° 


1717 


LM-3209.1 


From 1521533 


to 1522354 


SeqID n° 


1718 


LM-321.1 


From 126360 


to 127178 


SeqID n° 


1719 


LM-3210.1 


From 1521243 


to 1521533 


SeqID n° 1720 


LM-3211.1 


From 1520659 


to 1521225 


SeqID n° 1721 


LM-3212.1 


From 1520097 


to 1520672 


SeqID n° 
SeqID n° 


1722 
1723 


LM-3213.1 
LM-3214.1 


From 1519736 
From 1518950 


to 1520092 
to 1519681 


SeqID n° 


1724 


LM-3215.1 


From 1518245 


to 1518847 


SeqID n° 


1725 


LM-3216.1 


From 1517613 


to 1518173 


SeqID n° 


1726 


LM-3217.2 


From 1515356 


to 1517578 



SeqID n° 1727 LM-3218.2 From 1418803 to 1420095 



Unknown, similar to 
transketolase 
Unknown, similar to iron- 
sulfur cofactor synthesis 
protein 
Unknown, similar to 
transcription elongation 
factor GreA 
Unknown, similar to 
unknown proteins 
Unknown, similar to 5 - 
methylthioadenosine/S- 
adenosylhomocysteine 
nucleosidase 
Unknown, similar to 

oligopeptidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
shikimate 5- 
dehydrogenase (AroD) 
Unknown, similar to phage 
proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
integral membrane protein 
ComEA 
Unknown, similar to B. 
subtilis ComEB protein 
Unknown, similar to 
putative integral membrane 
protein ComEC specifically 
required for DNA uptake 
but not for binding 
Unknown, similar to 
putative proteases 



WO 01/77335 



PCT/FR01/01118 



127 



SeqlDn°1728 LM-3220.1 From 1417735 to 1418685 

SeqID n° 1729 LM-3221.1 From 1416686 to 1417738 

SeqlDn° 1730 LM-3223.1 From 1415152 to 1416693 

SeqID n° 1731 LM-3226.3 From 1413646 to 1414719 

SeqID n° 1732 LM-3228.3 From 1412455 to 1413294 

SeqID n° 1733 LM-3231.1 From 1410148 to 1412421 

SeqID n° 1734 LM-3234.2 From 84331 1 to 843724 

SeqID n° 1735 LM-3235.1 From 842781 to 843287 

SeqID n° 1736 LM-3237.1 From 842313 to 842765 

SeqID n° 1737 LM-3239.1 From 841 157 to 842086 

SeqID n° 1738 LM-324.1 From 124495 to 126363 

SeqID n° 1739 LM-3241.1 From 840200 to 841072 

SeqID n° 1740 LM-3243.3 From 839520 to 840062 

SeqID n° 1 741 LM-3244.3 From 838751 to 839452 

SeqID n° 1742 LM-3246.1 From 837548 to 838621 

SeqID n° 1743 LM-3248.1 From 836745 to 837551 

SeqID n° 1744 LM-3249.1 From 835939 to 836748 

SeqID n° 1745 LM-3250.1 From 834845 to 835939 

SeqID n° 1746 LM-3251 .2 From 834287 to 834829 



Unknown, similar to sugar 
ABC transporter, permease 
protein 
Unknown, similar to 
permease proteins 
Unknown, similar to sugar 
ABC transporter, ATP- 

binding protein 
Unknown, CD4+ T cell- 
stimulating antigen, 

lipoprotein 
Unknown, similar to 
pyrroline-5-carboxylate 

reductase 
Unknown, similar to DNA 

translocase 
Unknown, similar to E. coli 
PhnB protein 
Unknown, similar to B. 
subtilis regulatory protein 
PaiA 

unknown, similar to 
transcription regulators 
unknown, similar to 

oxidoreductases 
unknown, similar to 
bacteriophage minor tail 
proteins 
unknown, similar to 
fructokinases 
unknown 
unknown, similar to 
carbonic anhydrase 
Unknown, similar to 
spermidine/putrescine- 
binding protein 
unknown, similar to 
spermidine/putrescine ABC 
transporter, permease 
protein 
unknown, similar to 
spermidine/putrescine ABC 
transporter, permease 
protein 
unknown, similar to 
spermidine/putrescine ABC 
transporter, ATP-binding 
protein 
unknown, similar to 
transcription regulator 
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SeqlD n' 


5 1747 


LM-3252.2 


From 1724612 


to 1725565 


SeqID n [ 


' 1748 


LM-3254.1 


From 1725584 


to 1726993 


SeqID n c 


' 1749 


LM-3255.1 


From 1727009 


to 1727827 


SeqID n c 


' 1750 


LM-3256.1 


From 1727824 


to 1728651 


SeqID n c 


' 1751 


LM-3258.1 


From 1728653 


to 1730395 


SeqlD rt° 
SeqID n° 


1752 
1753 


LM-326.1 
LM-3260.2 


From 124104 
From 1730392 


to 124508 
to 1731780 


SeqID n° 


1754 


LM-3261.2 


From 2308741 


to 2311464 


SeqID n° 


1755 


LM-3262.1 


From 2311461 


to 2312696 


SeqID n° 


1756 


LM-3263.1 


From 2312839 


to 2313192 


SeqID n° 


1757 


LM-3265.1 


From 2313274 


to 2314407 


SeqID n° 


1758 


LM-3266.1 


From 2314511 


to 2315878 


SeqID n° 


1759 


LM-3267.1 


From 2315948 


to 2316742 


SeqID n° 


1760 


LM-3268.1 


From 2316739 


to 2317644 


SeqID n° 


1761 


LM-3269.2 


From 2317950 


to 2320094 


SeqID n° 


1762 


LM-3274.2 


From 558526 


to 560187 


SeqID n° 


1763 


LM-3275.1 


From 558037 


to 558480 


SeqID n° 


1764 


LM-3276.1 


From 557178 


to 557930 


SeqID n° 


1765 


LM-3278.1 


From 555660 


to 556976 


SeqID n° 
SeqlD n° 


1766 
1767 


LM-328.1 
LM-3281.1 


From 123760 
From 554624 


to 124062 
to 555628 


SeqID n° 


1768 


LM-3283.2 


From 553078 


to 554493 



unknown, similar to ABC 
transporter and adhesion 
proteins 
similar to O- 
succinylbenzoic acid-CoA 
ligase 
unknown, similar to 
dihydroxynapthoic acid 
synthetase 
unknown, similar to prolyl 
aminopetidases 
unknown, similar to 2- 
succinyl-6-hydroxy-2,4- 

cyciohexadiene-1- 
carboxylate synthase / 2- 
oxoglutarate decarboxylase 
unknown 
unknown, similar to 
menaquinone-specific 
isochorismate synthase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
fumarate hydratase 
Unknown, similar to 
unknown proteins 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
penicillin-binding protein 
Unknown, similar to 
putative sulfate transporter 
Unknown, similar to B. 
subtilis YybC protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 6- 
phospho-beta-glucosidase 
Unknwon 
Unknown, similar to 
transcription regulator 
Unknown, similar to 
multidrug resistance 
protein 
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SeqID n° 


1769 


LM-3284.1 


From 514004 


to 514381 


SeqID n° 


1770 


LM-3285.1 


From 514681 


to 515058 


SeqID n° 


1771 


LIVI-OZoD. I 


rrom 515391 


to 515750 


SeqID n° 1772 
SeqID n° 1773 
SeqID n° 1774 


LM-3288.1 
LM-329.1 
LM-3290.1 


From 516076 
From 123200 
From 516746 


to 516636 
to 123712 
to 518446 


SeqID n° 


1775 


LM-3292.1 


From 518597 


to 519700 


SeqID n° 
SeqID n° 
SeqID n" 


1776 
1777 
1778 


LM-3293.1 
LM-3294.2 
LM-3296.2 


From 519789 
From 520426 
From 1561538 


to 520268 
to 520791 
to 1561882 


SeqID n° 


1779 


LM-3298.1 


From 1559081 


to 1561432 


SeqID n° 


1780 


LM-3299.1 


From 1558570 


to 1559091 


SeqID n° 
SeqID n° 


1781 
1782 


LM-330.1 
LM-3301.1 


From 122798 
From 1556148 


to 123187 
to 1558364 


SeqID n° 


1783 


LM-3302.1 


From 1555680 


to 1556132 


SeqID n° 


1784 


LM-3303.2 


From 1554360 


to 1555643 



SeqID n° 1785 LM-3305.2 From 2279524 to 2279919 

SeqID n° 1786 LM-3306.1 From 2280238 to 2281206 

SeqID n° 1787 LM-3307.1 From 2281203 to 2282279 

SeqID n° 1788 LM-3309.1 From 2282297 to 2283331 

SeqID n° 1789 LM-331.1 From 122104 to 122520 

SeqID n° 1790 LM-3310.1 From 2283331 to 2284260 

SeqID n° 1791 LM-3312.2 From 2284539 to 2286215 



Unknown, putative 
secreted protein 
Unknown, putative 
secreted protein 
Unknown, putative 
secreted protein 
Unknown 
antigen A 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
conserved hypothetical 
proteins, highly similar to B. 
subtilis YloN protein 
Unknown 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to single- 
stranded-DNA-specific 
exonuclease (RecJ) 
unknown, similar to 
adenine 
phosphoribosyltransferase 
antigen B 
unknown, similar to 
(p)ppGpp synthetase 
Unknown, similar to 
unknown proteins 
Unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
oligopeptide ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
oligopeptide ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
oligopeptide ABC 
transporter (permease) 
unknown, similar to Antigen 
C 

Unknown, similar to 
oligopeptide ABC 
transporter (permease) 
Unknown, similar to 
pheromone binding protein 



WO 01/77335 



PCT/FR01/01118 



130 



SeqID n 


0 1792 


LM-3315.1 


From 1528098 


to 1528727 


SeqID n' 


3 1793 


LM-3316.1 


From 1528724 


to 1529377 


SeqID n' 


' 1794 


LM-3319.1 


From 1529454 


to 1530524 


SeqID n c 


' 1795 


LM-332.1 


From 121664 


to 122092 


SeqID n° 


1796 


LM-3320.1 


From 1530727 


to 1531353 


oeqiu n 


1797 


LM-3321.1 


From 1531398 


to 1531700 


SeqID n° 


1798 


LM-3322.1 


From 1531716 




SeqID n° 
SeqID n° 
SeqID n° 


1799 
1800 
1801 


LM-3323.1 
LM-3326.1 
LM-3327.1 


From 1532129 
From 153249"^ 
From 1535439 


to 1532401 

Wj I OOO I oZ. 

to 1536134 


SeqID n° 


1802 


LM-3329.2 


From 1536143 


to 1537648 


SeqID n° 


1803 


LM-333.1 


From 121112 


to 121447 


SeqID n° 


1804 


LM-3330.2 


From 512291 


to 513268 



SeqID n° 1805 
SeqID n° 1806 
SeqID n° 1807 
SeqID n° 1808 
SeqID n° 1809 
SeqID n° 1810 



LM-3331.1 
LM-3332.1 
LM-3333.1 
LM-3334,1 
LM-3335.2 
LM-3336.3 



From 
From 
From 
From 
From 
From 



511966 
511215 
510163 
509400 
507723 
506368 



to 512208 
to 511562 
to 511212 
to 510287 
to 508733 
to 506997 



SeqID n° 181 1 LM-3337.3 From 505351 to 506223 

SeqID n° 1812 LM-3338.2 From 1487699 to 1489579 

SeqID n° 1813 LM-3339.1 From 1489599 to 1490039 

SeqID n° 1814 LM-334.1 From 120654 to 121106 

SeqID n° 1815 LM-3340.1 From 1490229 to 1491053 

SeqID n° 1816 LM-3341.1 From 1491087 to 1493153 

SeqID n° 1817 LM-3342.1 From 1493146 to 1494036 



unknown, similar to Uridine 
kinase 
Unknown, similar to O- 
methyltransferase 
Unknown, similar to 
unknown proteins 
unknown, similar to Antigen 
D 

Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown 
alanyl-tRNA synthetase 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 

transporter 
Unknwon, similar to 
putative repressor C1 from 
lactococcal bacteriophage 
Tuc2009 
Unknown, similar to 
oxetanocin A resistance 
protein oxrB 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown, weakly similar to 
site-specific DNA- 
methyltransferase 
Unknown 
DNA primase 
Unknown, similar to 
unknown proteins 
Unknwon, similar to protein 
gp35 from Bacteriophage 
A118 
Unknown, similar to 
unknown protein 
Unknown, similar to glycyl- 
tRNA synthetase beta 
chain 

Unknown, similar to glycyl- 
tRNA synthetase alpha 
chain 
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SeqID n 


1818 


LM-3343.1 


From 1494317 


to 1495084 


SeqID n 
SeqID n° 


1819 
1820 


LM-3344.1 
LM-3345.1 


From 1495221 
From 1495899 


to 1495850 
to 1496804 


SeqID n° 


1821 


LM-3347.1 


From 1496801 


to 1497196 


SeqID n° 


1822 


LM-3348.1 


From 1497221 


to 1497616 


SeqID n° 


1823 


LM-3349.1 


From 1497594 


to 1498079 


SeqID n° 


1824 


LM-335.1 


From 119740 


to 120435 


SeqID n° 


1825 


LM-3355.1 


From 2012582 


to 2013727 


SeqID n° 1826 


LM-3356.1 


From 2014086 


to 2014760 


SeqID n° 


1827 


LlvI-3357.1 


From 2014776 


to 2015738 


SeqID n° 


1828 


LM-3359.1 


From 2015819 


to 2016538 


SeqID n° 
SeqID n° 


1829 
1830 


1 A ft /*% *~\ /~v Jt 

LM-336. 1 
LM-3360.1 


From 119063 
From 2016858 


to 119770 
to 2018261 


SeqID n° 


1831 


LM-3361.2 


From 2018258 


to 2019265 


SeqID n° 


1832 


LM-3362.2 


From 2019420 


to 2019644 


SeqID n° 


1833 


LM-3363.3 


From 2019686 


to 2020297 


SeqID n° 1834 


LM-3364.2 


From 1867802 


to 1869460 


SeqID n° 


1835 


LM-3371.2 


From 1872724 


to 1873620 



SeqID n° 1836 LM-3373.2 From 1873764 to 1875116 

SeqID n° 1837 LM-3374.2 From 1875129 to 1875461 

SeqID n° 1838 LM-3375.2 From 1358834 to 1359103 

SeqID n° 1839 LM-3376.1 From 1359352 to 1361523 

SeqID n° 1840 LM-3377.1 From 1361564 to 1362604 

SeqID n° 1841 LM-3378.2 From 1362765 to 1363244 



Unknown, similar to B, 
subtilis RecO protein 
involved in DNA repair and 
homologous recombination 

Unknown 
Unknown, similar to GTP 
binding proteins 
Unknown, similar to 
cytidine deaminase 
Unknown, similar to 
diacylglycerol kinase 
Unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
transcription regulators, 
Fnr/Crp family 
Unknown, similar to similar 
to ribosomal protein S1 like 
protein 
Unknown, similar to 

cytidylate kinase 
Unknown, similar to 

asparaginase 
Unknown, similar to 
unknown proteins 
Unknwon 
Unknown, similar to ATP- 
dependent DNA helicase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

ferredoxin 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
protein-tyrosine 
phosphatase 
Unknown, similar to signal 
recognition particle protein 
Ffh 

Unknown, similar to 
unknown proteins 
ribosomal protein S15 
polynucleotide 
phosphorylase (PNPase) 

unknown, similar to 
conserved hypothetical 

proteins 
unknown, similar to B. 
subtilis YqzC protein 
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SeqID n° 1842 LM-3379.2 From 1363272 to 1363628 

SeqID n° 1843 LM-338.1 From 117841 to 118956 

SeqID n° 1844 LM-3380.2 From 1918681 to 1919505 

SeqID n° 1845 LM-3381.1 From 1919525 to 1920436 



SeqID n° 1846 LM-3382.1 From 1920436 to 1920900 

SeqID n° 1847 LM-3384.1 From 1920982 to 1922265 

SeqID n° 1848 LM-3385.1 From 1922439 to 1923809 

SeqID n° 1849 LM-3387.1 From 1923825 to 1924757 



SeqID n° 1850 LM-3389.1 From 1924754 to 1925596 

SeqID n° 1851 LM-3390.3 From 1925600 to 1926322 

SeqID n° 1852 LM-3391 .3 From 262657 to 263262 

SeqID n° 1853 LM-3392.1 From 262064 to 262576 

SeqID n° 1854 LM-3393.1 From 261306 to 262061 

SeqID n° 1855 LM-3394.1 From 260896 to 261306 

SeqID n° 1856 LM-3398.1 From 259477 to 260892 

SeqID n° 1857 LM-340.1 From 116972 to 117805 

SeqID n° 1858 LM-3400.1 From 258856 to 259470 

SeqID n° 1859 LM-3403.3 From 256983 to 258458 

SeqID n° 1 860 LM-3404.3 From 256491 to 256964 

SeqID n° 1861 LM-3405.2 From 1862459 to 1862893 

SeqID n° 1862 LM-3406.2 From 1862893 to 1863480 



unknown, similar to B. 
subtilis YqzD protein 
Unknown, similar to lipase 
unknown, similar to 
unknown proteins 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, highly similar to 
signal peptidase II 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
adhesion binding proteins 

and lipoproteins with 
multiple specificity for metal 

cations 
unknown, similar to integral 
membrane proteins, ABC 
transporter 
unknown, similar to 
probable ABC transporter, 

ATP-binding proteins 
RNA polymerase sigma-30 
factor (sigma-H) 
Unknown, similar to B. 
subtilis Yacp protein 
Unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 

YacO protein 
Unknown, highly similar to 

B. subtilis YazC protein 
cysteinyl-tRNA synthetase 

Unknwon, similar to 
transcriptional regulatory 

proteins, AraC family 
unknown, similar to serine 

Oacetyltransferase 
unknown, highly similar to 
glutamyl-tRNA synthetase 
Unknown, similar to B. 
subtilis YacN protein 
Unknown, similar to 
transcription regulator 
Unknown, weakly similar to 
Nad(P)h Oxidoreductase 
chain B 
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SeqlDn°1863 LM-3407.1 From 1863501 to 1864217 

SeqID n° 1864 LM-3408.1 From 1864248 to 1864637 
SeqlDn°1865 LM-3409.1 From 1864657 to 1865394 



SeqID n° 1866 LM-3410.1 From 1865394 to 1865912 



SeqID n° 1867 

SeqID n° 1868 

SeqID n° 1869 
SeqID n° 1870 

SeqID n° 1871 

SeqID n° 1872 

SeqID n° 1873 

SeqID n° 1874 
SeqID n° 1875 

SeqID n° 1876 

SeqID n° 1877 

SeqID n° 1878 

SeqID n° 1879 

SeqID n° 1880 



LM-3411.1 

LM-3412.1 

LM-3414.2 
LM-3415.4 

LM-3416.3 

LM-3417.3 

LM-342.1 

LM-3420.4 
LM-3421.2 

LM-3422.1 

LM-3424.1 

LM-3426.1 

LM-3427.1 

LM-3428.1 



From 1865922 

From 1866336 

From 1867427 
From 1723649 

From 1723356 

From 1722431 

From 1 15087 

From 1716858 
From 1748792 

From 1749878 

From 1751283 

From 1752413 

From 1753095 

From 1753845 



to 1866308 

to 1867070 

to 1867699 
to 1724122 

to 1723598 

to 1723339 

to 116829 

to 1717160 
to 1749733 

to 1751176 

to 1752371 

to 1752940 

to 1753841 

to 1754942 



SeqID n° 1881 LM-3429.1 From 1755143 to 1756123 

SeqID n° 1882 LM-343.1 From 113313 to 115094 

SeqID n° 1883 LM-3430.2 From 1756156 to 1756617 

SeqID n° 1884 LM-3431.2 From 2891269 to 2891652 

SeqID n° 1885 LM-3432.2 From 2890779 to 2891171 

SeqID n° 1886 LM-3433.1 From 2890274 to 2890666 

SeqID n° 1887 LM-3434.1 From 2888994 to 2890277 

SeqID n° 1888 LM-3436.1 From 2888639 to 2889001 



Unknown, similar to 
unknown proteins 
Unknown 
Unknown, similar to E. coli 
tRNA (guanine-N1) 
methyltransferase 
Unknown, similar to 
putative 16S rRNA 
processing protein RimM 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
ribosomal protein S16 
unknown, some similarity 
to hypothetical proteins 
unknown, some similarity 
to hypothetical proteins 
unknown, similar to L- 
lactate dehydrogenases 
Unknown, ABC transporter, 
ATP-binding protein 

unknown 
unknown; similar to 
glycerate dehydrogenases 
glutamate-1-semialdehyde 
aminotransferase 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
glucose 1 -dehydrogenase 
unknown, similar to A/G- 
specific adenine 
glycosylase 
unknown, similar to 
hypothetical proteins 
Unknwon, similar to ABC 
transporter, ATP-binding 
protein 
unknown, similar to 
deoxyuridine triphosphate 
nucleotidohydrolases 
Unknown, hypothetical 

secreted protein 
Unknown, hypothetical 

secreted protein 
Unknown, hypothetical 
secreted protein 
Unknown 
Unknown 
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SeqlDn°1889 LM-3439.2 From 2887499 to 2888215 
SeqlDn°1890 LM-3442.2 From 885440 to 886882 



SeqlDn°1891 LM-3444.1 From 886875 to 887603 

SeqID n° 1892 LM-3445.2 From 887652 to 889502 

SeqID n° 1893 LM-3447.3 From 2068977 to 2069648 

SeqlDn°1894 LM-3448.1 From 2069710 to 2070657 

SeqID n° 1895 LM-3449.1 From 2070754 to 2071170 

SeqlDn° 1896 LM-345.1 From 112386 to 113288 

SeqlDn°1897 LM-3450.1 From 2071187 to 2072182 

SeqlDn° 1898 LM-3451.1 From 2072205 to 2073269 

SeqID n° 1899 LM-3452.1 From 2073282 to 2074103 

SeqID n° 1900 LM-3453.1 From 2074084 to 2074902 

SeqID n° 1901 LM-3455.1 From 2074925 to 2075392 

SeqID n° 1902 LM-3456.2 From 2075412 to 20761 13 

SeqID n° 1 903 LM-3458.2 From 20761 15 to 2076846 

SeqID n° 1904 LM-3459.2 From 607372 to 608199 

SeqID n° 1905 LM-3460.1 From 608184 to 608480 

SeqID n° 1906 LM-3461 .1 From 608557 to 609588 

SeqID n° 1907 LM-3462.2 From 609629 to 610924 

SeqID n° 1908 LM-3463.2 From 1776257 to 1776829 

SeqID n° 1909 LM-3465.1 From 1774991 to 1775983 



GidB protein 
unknown, similar to 
Glutamine binding and 
transport protein 
unknown, sfmilar to amino 
acid ABC transporter, ATP- 
binding protein 
Unknown, similar to 

amidases 
Unknown, similar to 
deoxyribose-phosphate 
aldolase 
Unknown, similar to 
transcription repressor of 
dra/nupC/pdp operon 
DeoR 

Unknown, similar to PTS 
mannose-specific enzyme 
IIA component 
Unknown, similar to 
transcription regulator 
Unknown, similar to opine 

catabolism protein 
Unknown, weakly similar to 
glucosamine-fructose-6- 
phosphate 
aminotransferase 
Unknown, similar to 
mannose-specific enzyme 

IID component 
Unknown, similar to PTS 
mannose-specific enzyme 

IIC component 
Unknown, similar to PTS 
mannose-specific enzyme 
IIB component 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, similar histidinol 
phosphate phosphatase 
Unknown, similar to 
methyltransferase 
Unknown 
Unknown, conserved 
hypothetical protein 
unknown 
unknown, similar to cell- 
shape determining proteins 
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SeqID n' 


3 1910 


LM-3466 1 

1— IVI otuu. i 


Prr\m i "7"7QJ^ -1 A 
rl UFil I / / OO 


to 1774773 


SeqID rV 


' 1911 






to 1773410 


SeqID n c 
SeqID n c 


' 1912 
' 1913 


LM-3469.2 
LM-3473 1 


From 833117 

rl Ui 1 1 OO 1 U^iD 


to 833587 
to oo3073 


SeqID n° 


1914 


LM-3475.1 


From 830259 


to 830897 


SeqID n° 


1915 


LM-3479.2 


From 827666 


to 828016 


SeqID n° 
SeqID n° 


1916 
1917 


LM-3480.3 
LM-3481.2 


From 826752 
From 2011128 


to 827513 
to 2012438 


SeqID n° 


1918 


LM-3482.1 


From 2010089 


to 201 1 1 05 


SeqID n° 


1919 


LM-3483.2 


From 2009006 


to 2009986 


SeqID n° 


1920 


LM-3484.1 


From 2008319 


to 2008594 


SeqID n° 


1921 


LM-3485.1 


From 2007515 


to 2008084 


SeqID n° 


1922 


LM-3486.1 


From 2006689 


to 2007456 



SeqID n° 1923 LM-3487.1 From 2005953 to 2006666 
SeqID n° 1924 LM-3488.1 From 2004977 to 2005942 



SeqID n° 1925 LM-3489.2 

SeqID n° 1926 LM-349.1 

SeqID n° 1927 LM-3490.2 

SeqID n° 1928 LM-3491.1 

SeqID n° 1929 LM-3493.3 



From 20045 1 5 to 2004958 

From 110013 to 112283 

From 1029208 to 1029819 

From 1029965 to 1030756 

From 1030757 to 1032229 



unknown, similar to 
multidrug resistance 
protein, integral membrane 

protein 
unknown, highly similar to 
aminopeptidases 
Unknown 
Unknown, similar to 
putative Na+/H+ antiporter 
Unknown, weakly similar to 
GTP-pyrophosphokinase 
Unknown, similar to B. 
subtilis YqkB protein 

Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
NAD(P)H-dependent 
glycerol-3-phosphate 

dehydrogenase 
Unknown, similar to 
protein-tyrosine/serine 
phosphatase 
Unknown, similar to non- 
specific DNA-binding 
protein HU 
Unknown, similar to GTP 
cyciohydrolase I 
Unknown, similar to 
heptaprenyl diphosphate 
synthase component I 
Unknown, similar to 2- 
heptaprenyl-1 ,4- 
naphthoquinone 
methyltransferase 
Unknown, similar to 
heptaprenyl diphosphate 
synthase component II 
(menaquinone 
biosynthesis) 
Unknown, similar to 
nucleoside diphosphate 
kinase 

Unknown, highly similar to 
chitinase B 
unknown, similar to 
hypothetical protein 

unknown 
unknown, similar to 
PHYTOENE 
DEHYDROGENASE (EC 
1.3.-.-) (PHYTOENE 
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DESATURASE) 



SeqlDn° 1930 LM-3494.3 From 1032333 to 1032617 

SeqiDn°1931 LM-3495.3 From 1032924 to 1033190 

SeqIDn° 1932 LM-3497.4 From 1033190 to 1034908 

SeqID n° 1933 LM-3498.1 From 1035021 to 1036055 

SeqID n° 1934 LM-35.1 From 2735279 to 2735953 

SeqID n° 1935 LM-3500.1 From 1036068 to 1036928 

SeqID n° 1936 LM-3501.2 From 1036971 to 1038116 

SeqID n° 1937 LM-3504.2 From 641208 to 642308 

SeqID n° 1938 LM-3505.1 From 642414 to 642914 

SeqID n° 1939 LM-3506.1 From 642962 to 643348 

SeqID n° 1940 LM-3507.1 From 643400 to 643744 

SeqID n° 1941 LM-3508.2 From 643891 to 645231 



SeqID n° 1942 LM-351.2 From 109349 to 109720 

SeqID n° 1943 LM-3510.2 From 2145652 to 2146044 

SeqID n° 1944 LM-3511.1 From 2146089 to 2146298 

SeqID n° 1945 LM-3512.1 From 2146449 to 2147426 



SeqID n° 1946 LM-3514.2 From 2147684 to 2149312 

SeqID n° 1947 LM-3517.2 From 1888615 to 1890273 

SeqID n° 1948 LM-3518.1 From 1887913 to 1888575 

SeqID n° 1949 LM-3519.1 From 1886887 to 1887777 

SeqID n° 1950 LM-352.2 From 108624 to 109256 

SeqID n° 1951 LM-3520.2 From 1884846 to 1886894 



unknown, similar to B. 
subtilis protein YkvS 
PHOSPHOCARRIER 
PROTEIN HPR 
(HISTIDINE-CONTAINING 
PROTEIN), 
phosphotransferase 

system enzyme I 
unknown, conserved 
hypothetical protein 
Unknown, similar to 
ribulose~5-phosphate 3- 

epimerase 
unknown, similar to 3- 
hydroxyisobutyrate 
dehydrogenase (B. subtilis 
YkwC protein) 
unknown, similar to 
aminotransferases (to B. 

subtilis PatA protein) 
UNknown, similar to ceil 
surface protein 
Unknown, weakly similar to 
transcription regulator 

Unknown 
Unknown, similar to B. 
subtilis YvIA protein 
Unknown, conserved 
hypothetical membrane 
protein 
Unknwon 
Unknown 
Unknown 
Unknown, similar to 
conjugated bile acid 
hydrolase 
class I heat-shock protein 
(chaperonin) GroEL 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phosphoglycerate 
dehydrogenase 
Unknown, similar to L- 
serine dehydratase 
Unknwon, similar to NADH 

oxidase 
Unknown, similar to ATP- 
dependent DNA helicase 
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recG 



SeqID n° 1952 

SeqID n°1953 

SeqID n° 1954 

SeqID n° 1955 

SeqID n° 1956 

SeqID n° 1957 
SeqID n° 1958 
SeqID n° 1959 



LM-3523.1 

LM-3524.1 

LM-3525.1 

LM-3527.1 

LM-3528.2 

LM-353.1 
LM-3533.1 
LM-3534.1 



From 2143109 

From 2142480 

From 2141820 

From 2141071 

From 2140046 

From 108320 
From 1053426 
From 1054319 



to 2144734 

to 2143097 

to 2142461 

to 2141814 

to 2140963 

to 108610 
to 1054277 
to 1055284 



SeqID n° 1960 LM-3535.1 From 1055393 to 1057060 

SeqID n° 1961 LM-3537.1 From 1057761 to 1058531 

SeqID n° 1962 LM-3538.3 From 1058580 to 1059608 

SeqID n° 1963 LM-3539.1 From 2286839 to 2287183 

SeqID n° 1964 LM-354.1 From 107915 to 108208 

SeqID n° 1965 LM-3541.1 From 2287518 to 2288513 

SeqID n° 1966 LM-3543.1 From 2288571 to 2288987 

SeqID n° 1967 LM-3544.1 From 2288971 to 2289423 

SeqID n° 1968 LM-3547.1 From 2289545 to 2290786 

SeqID n° 1969 LM-3549.2 From 2290989 to 2291927 

SeqID n° 1970 LM-355.1 From 107499 to 107849 

SeqID n° 1971 LM-3551.1 From 1242100 to 1244508 

SeqID n° 1972 LM-3553.2 From 1241048 to 1242100 

SeqID n° 1973 LM-3555.2 From 2305632 to 2305994 

SeqID n° 1974 LM-3556.1 From 2305180 to 2305602 



Unknown, similar to copper 
export proteins 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
potassium channel subunit 
Unknown, similar to heme 
O oxygenase 
Unknwon 
unknown 
unknown, similar to B. 
subtilis LytR protein 
unknown, similar to 
conserved hypothetical 
proteins (in particular B. 
subtilis YkqC) 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
transcriptional regulator, 
Lacl family 
Unknown 
Unknwon, similar to 
transcription regulator 
tryptophanyl-tRNA 

synthetase 
Unknown, similar to 

unknown protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 3- 
oxoacyl-acyl-carrier protein 
synthase 
Unknown, similar to 3- 
oxoacyl- acyl-carrier 
protein synthase 

Unknwon 
phenylalanyl-tRNA 
synthetase beta subunit 

phenylalany-tRNA 
synthetase beta subunit 
Unknown, similar to 

unknown protein 
Unknown, similar to 
histidine triad (HIT) protein 
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SeqID n° 1 975 LM-3557. 1 From 2304267 to 2305019 



SeqID n° 1976 LM-3558.1 From 2303060 to 2304283 



SeqID n° 1977 LM-3559.1 From 2302518 to 2303021 

SeqID n° 1978 LM-356.1 From 106988 to 107377 
SeqID n° 1979 LM-3560.1 From 2301260 to 2302321 



SeqID n° 1980 LM-3561.1 From 2300334 to 2301263 

SeqID n° 1981 LM-3562.2 From 2299833 to 2300177 

SeqID n° 1982 LM-3563.2 From 1548034 to 1548456 

SeqID n° 1983 LM-3565.2 From 1548701 to 1549906 



SeqID n° 1984 LM-3566.1 From 1549920 to 1550285 



SeqID n° 1985 LM-3567.1 From 1550435 to 1550815 
SeqID n° 1986 LM-3569.1 From 1550854 to 1552629 
SeqID n° 1987 LM-357.1 From 105951 to 106862 



SeqID n° 1988 LM-3570.2 From 1552632 to 1553909 
SeqID n° 1989 LM-3571.4 From 1713234 to 1715099 

SeqID n° 1990 LM-3572.1 From 1712634 to 1713209 



SeqID n° 1991 LM-3573.2 From 1711672 to 1712637 

SeqID n° 1992 LM-3574.3 From 896003 to 897028 

SeqID n° 1993 LM-3576.1 From 895278 to 895991 

SeqID n° 1994 LM-3577.1 From 893838 to 89521 1 



SeqID n° 1995 LM-3578.1 From 892662 to 893774 
SeqID n° 1996 LM-3579.1 From 892192 to 892512 



Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ABC 
transporter (membrane 
protein) 
Unknown, similar to 
unknown protein 
unknown 
Unknown, similar to 
uroporphyrinogen III 

decarboxylase 
Unknown, similar to 
ferrochelatase 
Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
ammonium transporter 
NrgA 
Unknown, similar to 
nitrogen regulatory PI I 
protein 
Unknown 
aspartyl-tRNA synthetase 
Unknown, similar to PTS 
system mannose-specific, 

factor II D 
histidyl-tRNA synthetase 

unknown, similar to 
asparagine synthetase 

unknown, similar to 
conserved hypothetical 
protein 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
transcription regulator lad 
family 
unknown, similar to 
carboxylesterase 
UDP-N- 
acetylmuramoylalanyl-D- 

glutamyl-2,6- 
diaminopimelate-D-alanyl- 

D-alanyl ligase 
D-alanine-D-alanine ligase 
Unknown, similar to E. coli 
SugE protein 
(transmembrane 
chaperone) 
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SeqID n° 1997 LM-358.1 From 105123 to 105929 



SeqID n° 1998 LM-3581.1 From 891848 to 892189 



SeqID n° 1999 LM-3582.1 From 891277 to 891831 



SeqID n° 2000 LM-3583.3 From 890439 to 891 197 
SeqID n° 2001 LM-3585.2 From 1926935 to 1928425 



SeqID n° 2002 LM-3588.1 From 1928798 to 1931011 



SeqID n° 2003 LM-3589.1 From 1931026 to 1931319 



SeqID n° 2004 LM-359.1 From 104134 to 105099 



SeqID n° 2005 LM-3590.2 From 1931442 to 1932266 



SeqID n° 2006 LM-3591.3 From 1932324 to 1933025 

SeqID n° 2007 LM-3592.2 From 2487261 to 2488337 

SeqID n° 2008 LM-3593.1 From 2488379 to 2489209 

SeqID n° 2009 LM-3596. 1 From 2489274 to 2489948 



SeqID n° 201 0 LM-3597.1 From 2489945 to 2490967 



SeqID n° 201 1 LM-3599.1 From 2491552 to 2492694 



SeqID n° 2012 LM-36.1 From 2735962 to 2736402 

SeqID n° 2013 LM-360.1 From 103204 to 103836 
SeqID n° 2014 LM-3600.1 From 2492684 to 2493379 



SeqID n° 201 5 LM-3601 .3 From 2493454 to 2494329 

SeqID n° 2016 LM-3602.2 From 1435635 to 1437866 
SeqID n° 2017 LM-3603.1 From 1437943 to 1438689 



Unknown, similar to PTS 
system mannose-specific, 

factor IIC 
Unknown, similar to E. coli 
SugE protein 
(transmembrane 
chaperone) 
Unknown, similar to 
transcription regulator 
TetR/AcrR family 

Unknown 
unknown, similar to 
carboxy-terminal 
processing proteinase 
unknown, similar to heavy 
metal-transporting 

ATPases 
unknown, similar to 
conserved hypothetical 

proteins 
Unknown, similar to PTS 
system mannose-specific, 

factor IIAB 
unknown, similar to similar 
to D-alanyl-D-alanine 
carboxypeptidases 
purine nucleoside 
phosphorylase 
Unknown 
Unknown, conserved 
lipoprotein 
Unknown, similar to ABC 
transporter, permease 
protein 
Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to two- 
component sensor histidine 
kinase 

Unknwon, similar to ribose 
5-phosphate epimerase 

unknown 
Unknown, similar to two- 
component response 

regulator 
Unknown, conserved 
hypothetical protein 
pyruvate formate-lyase 
pyruvate-formate lyase 
activating enzyme 
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SeqID n° 2018 LM-3604.1 From 1438724 to 1439254 
SeqID n° 2019 LM-3605.2 From 1439421 to 1440614 
SeqID n° 2020 LM-3608.2 From 1 104863 to 1 105864 



SeqID n° 2021 LM-361.1 From 102468 to 103028 
SeqID n° 2022 LM-3612.1 From 1 107975 to 1 109663 



SeqID n° 2023 LM-3613.1 From 1109705 to 1110577 

SeqID n° 2024 LM-3614.3 From 1110769 to 1113627 

SeqID n° 2025 LM-3615.2 From 1277312 to 1278025 

SeqID n° 2026 LM-3616.1 From 1278056 to 1279702 

SeqID n° 2027 LM-3617.3 From 1279721 to 1281205 

SeqID n° 2028 LM-3618.3 From 1434661 to 1435209 

SeqID n° 2029 LM-362.1 From 102155 to 102478 

SeqID n° 2030 LM-3620.2 From 1432839 to 1434644 

SeqID n° 2031 LM-3623.1 From 1430237 to 1432819 

SeqID n° 2032 LM-3624.2 From 1429765 to 1430127 

SeqID n° 2033 LM-3625.2 From 758656 to 759396 

SeqID n° 2034 LM-3626.2 From 756738 to 758543 

SeqID n° 2035 LM-3627.2 From 598020 to 598925 

SeqID n° 2036 LM-3628.2 From 598968 to 600344 



Unknown, similar to 
unknown proteins 
Unknown, similar to 
multidrug-efflux transporter 
unknown, similar to 
TEICHOIC ACID 
TRANSLOCATION ATP- 
BINDING PROTEIN TAGH 
(ABC transporter) 

unknown 
unknown, similar to 
TEICHOIC ACID 
BIOSYNTHESIS PROTEIN 
B PRECURSOR 
unknown, similar to 
putative UDP-glucose 
pyrophosphorylases 
unknown, similar to B. 
subtilis YfhO protein 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, similar to 

alpha, alpha- 
phosphotrehalase 
Unknown, similar to PTS 
system trehalose specific 
enzyme IIBC 
Unknown, similar to 
putative anti-terminator 

regulatory protein 
Unknwon, similar to ATP 
synthase epsilon chain 
DNA mismatch repair 

protein 
DNA mismatch repair 

(recognition) 
Unknown, similar to B. 
subtilis YmcA protein 
Unknown, similar to 
riboflavin kinase / FAD 

synthase 
Unknown, similar to L- 
gIutamine-D-fructose-6- 
phosphate 
amidotransferase 
Unknown, putative 
membrane protein 
Unknown, similar to NADP- 
specific glutamate 
dehydrogenase 
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SeqiD n° 2037 LM-363.1 From 100772 to 102142 

SeqID n° 2038 LM-3630.4 From 1 34782 to 1 36290 

SeqID n° 2039 LM-3631.2 From 133961 to 134710 

SeqID n° 2040 LM-3632.2 From 2292047 to 22931 74 

SeqID n° 2041 LM-3633.1 From 2293775 to 2294464 

SeqID n° 2042 LM-3635.2 From 2294555 to 22971 55 

SeqID n° 2043 LM-3636.2 From 1281321 to 1281773 

SeqID n° 2044 LM-3637.1 From 1281820 to 1282284 

SeqID n° 2045 LM-3638.1 From 1282473 to 1283393 

SeqID n° 2046 LM-3639.1 From 1283413 to 1284660 

SeqID n° 2047 LM-364.1 From 99902 to 100771 

SeqID n° 2048 LM-3640.1 From 1284644 to 1285474 

SeqID n° 2049 LM-3643.3 From 1285610 to 1286749 

SeqID n° 2050 LM-3644.4 From 1286830 to 1287225 

SeqID n° 2051 LM-3645.2 From 1765915 to 1767294 

SeqID n° 2052 LM-3647.1 From 1765499 to 1765900 



SeqID n° 2053 LM-3648.1 From 17651 17 to 1765470 
SeqID n° 2054 LM-3649.1 From 1763941 to 1764843 



SeqID n° 2055 LM-3650.1 From 1763209 to 1763751 



SeqID n° 2056 LM-3652.2 From 1762200 to 1763165 



SeqID n° 2057 LM-3656.2 From 2840286 to 2841896 



SeqID n° 2058 LM-3657.2 From 2839698 to 2840228 

SeqID n° 2059 LM-3658.1 From 1595227 to 1597149 
SeqID n° 2060 LM-3659.1 From 1597499 to 1598422 



Unknown, similar to ATP 

synthase beta chain 
Unkown, similar to inosine 
monophosphate 
dehydrogenase 
Unknwon, conserved 
hypothetical protein 
Unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase and to internalin B 

Unknown, similar to 
phosphoglyceromutase 1 

Unknown, similar to 
endopeptidase Clp ATP- 
binding chain B (CIpB) 
unknown 
unknown 
unknown 
gamma-glutamyl 
phosphate reductase 
unknwon, similar to ATP 
synthase gamma chain 
gamma-glutamyl kinase 
unknown 
unknown, similar to 
transcriptional regulator 
(phage-related) 
unknown, similar to similar 
to RNA methyltransferases 
unknown, similar to 
glutathione transferase - 
fosfomycin resistance 
protein 
unknown 
unknown, some similarities 
to methyl-accepting 
chemotaxis proteins 
unknown, similar to 
ribosomal-protein-alanine 
N-acetyltransferase 
unknown, similar to 
putative transmembrane 

proteins 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
unknown protein 
threonyl-tRNA synthetase 
primosome component 
(helicase loader) Dnal 
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SeqID n° 2061 LM-366. 1 From 98409 to 99905 

SeqID n° 2062 LM-3660.2 From 1598432 to 1599808 

SeqID n° 2063 LM-3662.3 From 2708276 to 2709358 

SeqID n° 2064 LM-3663.2 From 2707235 to 2708194 

SeqID n° 2065 LM-3664.1 From 2706415 to 2707215 

SeqID n° 2066 LM-3666.1 From 2705746 to 2706054 

SeqID n° 2067 LM-3667.1 From 2705082 to 270571 1 

SeqID n° 2068 LM-3668.1 From 2704433 to 2705056 

SeqID n° 2069 LM-3669.1 From 2704149 to 2704433 

SeqID n° 2070 LM-367.1 From 97381 to 98412 

SeqID n° 2071 LM-3671 .2 From 2703275 to 2704108 

SeqID n° 2072 LM-3673.2 From 2033587 to 2034528 

SeqID n° 2073 LM-3674.1 From 2032460 to 2033485 

SeqID n° 2074 LM-3676.2 From 2031438 to 2032460 

SeqID n° 2075 LM-3677.2 From 1756864 to 1757673 

SeqID n° 2076 LM-3678.1 From 1757771 to 1758673 

SeqID n° 2077 LM-3679.2 From 1758694 to 1761291 



SeqID n° 2078 LM-368.1 From 97127 to 97369 

SeqID n° 2079 LM-3680.2 From 1761318 to 1761992 

SeqID n° 2080 LM-3681.2 From 1708312 to 1708518 

SeqID n° 2081 LM-3684.3 From 1708857 to 1711268 

SeqID n° 2082 LM-3692.1 From 2161161 to 2162054 

SeqID n° 2083 LM-3693.1 From 2160538 to 2161164 

SeqID n° 2084 LM-3694.1 From 2160043 to 2160432 

SeqID n° 2085 LM-3695.2 From 980177 to 981307 

SeqID n° 2086 LM-3696.1 From 981743 to 982960 

SeqID n° 2087 LM-3697.2 From 982957 to 983646 

SeqID n° 2088 LM-3698.2 From 983768 to 984814 



Unknwon, similar to ATP 

synthase alpha chain 
chromosome replication 

initiation / membrane 
attachment protein DnaB 

Unknown, conserved 
hypothetical lipoprotein 
Unknown, weakly similar to 
E. coli MenA protein 
Unknown, similar to B. 
subtilis YbaF protein 
ribosomal protein S10 
ribosomal protein L3 
ribosomal protein L4 
ribosomal protein L23 
Unknwon, weakly similar to 
ATP synthase delta chain 
ribosomal protein L2 
Unknown, similar to 
ferrichrome binding protein 
Unknown, similar to 
ferrichrome ABC 
transporter (permease) 
Unknown, similar to 
ferrichrome ABC 
transporter (permease) 
unknown, similar to 
hypothetical proteins 
unknown, similar to CDP- 
abequose synthase 
unknown, similar to 
putative membrane 
proteins 
Unknwon, similar to ATP 
synthase C chain 
unknown, similar to 
unknown proteins 
unknown 
leucyl-tRNA synthetase 
Unknown 
unknown 
Unknown, similar to 
unknown proetin 
Unknown, similar to B. 
subtilis ComEC protein 
unknown, hypothetical 
transport protein 
unknown, similar to 
transcription regulator 
unknown, conserved 
hypothetical membrane 
protein 
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SeqlDn°2089 LM-3699.2 From 2658024 to 2658851 

SeqID n° 2090 LM-370.1 From 94764 to 97070 

SeqID n° 2091 LM-3701.3 From 1936764 to 1937603 

SeqID n° 2092 LM-3703.3 From 1935964 to 1936749 

SeqID n° 2093 LM-3704.3 From 1935334 to 1935948 

SeqID n° 2094 LM-3705.1 From 1934765 to 1935298 

SeqID n° 2095 LM-3706.1 From 1934321 to 1934758 

SeqID n° 2096 LM-3707.3 From 1933333 to 1934238 

SeqID n° 2097 LM-3708.2 From 82261 1 to 823531 

SeqID n° 2098 LM-3709.1 From 821886 to 822527 

SeqID n° 2099 LM-3710.1 From 821070 to 821771 

SeqID n° 2100 LM-3711.3 From 820145 to 821035 

SeqID n° 2101 LM-3712.2 From 1767359 to 1767739 

SeqID n° 2102 LM-3713.1 From 1767846 to 1768490 

SeqID n° 2103 LM-3714,1 From 1768520 to 1769389 

SeqID n° 2104 LM-3715.1 From 1769720 to 1770526 

SeqID n° 2105 LM-3716.2 From 1770648 to 1771406 



SeqID n° 2106 LM-3717.2 From 1818007 to 1818516 
SeqID n° 2107 LM-3718.1 From 1817112 to 1817879 



SeqID n° 2108 LM-3719.3 From 1815143 to 1817122 
SeqID n° 2109 LM-3720.2 From 1293088 to 1293654 



Unknown, conserved 
hypothetical protein 

unknwon 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
peptidyl methionine 
sulfoxide reductases 
unknown, similar to 
transcriptional regulator 
(PilB family) 
unknown, similar to 
dehydogenases and 
hypothetical proteins 
Unknown, conserved 
hypothetical protein 
Unknown, similar to B. 
subtilis YwnB protein 
Unknown, similar to 
conserved hypothetical 
protein 
Unknown, similar to 
conserved hypothetical 
protein 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
deoxyguanosine 
kinase/deoxyadenosine 
kinase(l) subunit 
unknown, similar to 
transport proteins 
unknown, similar to 
aminoglycoside N3 - 
acetyitransferases 
unknown, similar to 

methionine 
aminopeptidases 
Unknown 
Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ABC 
transporter (permease) 
unknown, similar to type-l 
signal peptidase 
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SeqlDn°2110 LM-3722.1 From 1291710 to 1292969 



SeqlDn°2111 LM-3724.1 From 1290241 to 1291524 



SeqlDn°2112 
SeqID n°2113 



LM-3725.2 
LM-3726.3 



From 1289188 
From 1703330 



to 1290126 
to 1703824 



SeqID n° 2114 LM-3727.1 From 1704618 to 1705181 



SeqID n°2115 
SeqID n°2116 
SeqID n°2117 

SeqID n°2118 
SeqID n°2119 



LM-3728.1 
LM-373.1 
LM-3732.2 

LM-3733.3 
LM-3735.3 



From 1705199 

From 88888 
From 1706204 

From 1707168 
From 1645412 



to 1705630 

to 94767 
to 1707088 

to 1707917 
to 1645936 



SeqID n° 2120 LM-3736.3 From 1644106 to 1645191 



SeqID n° 2121 
SeqID n°2122 
SeqID n°2123 
SeqID n°2124 

SeqID n° 2125 

SeqID n° 2126 

SeqID n°2127 

SeqID n°2128 

SeqID n° 2129 
SeqID n°2130 

SeqID n°2131 

SeqID n°2132 

SeqID n°2133 
SeqID n°2134 



LM-3737.1 
LM-3739.2 
LM-374.1 
LM-37462 

LM-3747.1 

LM-37492 

LM-375.1 

LM-3750.2 

LM-37542 
LM-37562 

LM-37572 

LM-3758.2 

LM-3759.2 
LM-376.1 



From 1642865 
From 1641284 

From 88121 
From 1428442 

From 1426766 

From 1425419 

From 86747 

From 356890 

From 360172 
From 1407859 

From 1406742 

From 824936 

From 824423 
From 86323 



to 1643872 
to 1642543 

to 88810 
to 1428939 

to 1428328 

to 1426465 

to 87730 

to 357705 

to 360507 
to 1408818 

to 1407818 

to 826396 

to 824884 
to 86691 



SeqID n° 2135 LM-3763.1 From 1659457 to 1660926 

SeqID n° 2136 LM-3764.1 From 1660923 to 1661363 

SeqID n° 2137 LM-3766.3 From 1661588 to 1662457 

SeqID n° 2138 LM-3767.2 From 2212384 to 2212620 



ATP-dependent Clp 
protease ATP-binding 
subunit CIpX 
trigger factor (prolyl 
isomerase) 
unknown 
unknown, putative 
cellsurface protein 
unknown, similar to 
unknown proteins 
unknown 
Unknown 
similar to translation 
elongation factor 
30S ribosomal protein S2 
Unknown, similar to 
general stress protein 
3-deoxy-D-arabino- 
heptulosonate 7-phosphate 

synthase 
catabolite control protein A 
tyrosyl-tRNA synthetase 

unknwon 
Unknown, similar to N- 
acetyltransferase 
Unknown, similar to 
unknown protein 
Recombination protein 
recA 

Unknwon, similar to 

oxidoreductases 
Unknown, similar to 
transposase 
Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
unknown protein 
Unknown, similar to lysine- 
specific permease 

Unknown 
Unknown, similar to 
transcription regulator 
(merR family) 
Unknown, similar to 
multidrug-efflux transporter 
Unknown, similar to 
transcription regulator 
MarR family 
D-Amino Acid 
Aminotransferase 
unknown 
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SeqlDn°2139 LM-3768.2 From 2213218 to 2215041 



SeqlDn°2140 LM-3769.2 From 2306177 
Seq!Dn°2141 LM-377.1 From 85902 
SeqlDn°2142 LM-3770.2 From 2306833 



SeqlDn°2143 LM-3772.2 From 2307755 

SeqlDn°2144 LM-3774.3 From 991810 

SeqID n° 2145 LM-3775.1 From 990968 

SeqID n° 2146 LM-3776.2 From 990248 



Seq I D n° 2 1 47 LM-3778 .2 From 1 86 1 907 

SeqID n° 2148 LM-3779.3 From 1860200 

SeqID n° 2149 LM-378.1 From 85025 

SeqID n° 2150 LM-3783.2 From 989099 



SeqID n° 2151 LM-3785.2 From 988065 

SeqID n° 2152 LM-379.1 From 84451 

SeqID n° 2153 LM-3790.3 From 1420076 

SeqID n° 2154 LM-3791.2 From 1778415 

SeqID n° 2155 LM-3792.1 From 1777763 

SeqID n° 2156 LM-3793.2 From 1776910 

SeqID n° 2157 LM-3795.3 From 1421454 

SeqID n° 2158 LM-3797.2 From 1422236 

SeqID n° 2159 LM-3798.1 From 1423255 



to 2306716 

to 86228 
to 2307714 



to 2308696 
to 992865 

to 991690 
to 990952 



to 1862251 
to 1861090 
to 85627 
to 990232 



to 988910 
to 84849 
to 1421362 

to 1779482 

to 1778338 

to 1777578 
to 1422185 

to 1423165 
to 1423833 



SeqID n° 2160 LM-380.1 From 82959 to 84437 



Unknown, similar to 
unknown protein 
Unknown 
Unknwon 
Unknown, similar to post- 
translocation molecular 

chaperone 
Unknown, similar to S. 
aureus Cbf1 protein 
unknown, similar to 
UNDECAPRENYL- 
PHOSPHATE N- 
ACETYLGLUCOSAMINYL 
TRANSFERASE 
unknown, similar to 
transcription regulator 
(GntR family) 
GLUCOSAMINE-6- 
PHOSPHATE 
ISOMERASE (EC 5.3.1.10) 
(GLUCOSAMINE-6- 
PHOSPHATE 
DEAMINASE) (GNPDA) 
(GLCN6P DEAMINASE), 
ribosomal protein L19 
internalin C 
Unknwon 
N- 

ACETYLGLUCOSAMINE- 
6-PHOSPHATE 
DEACETYLASE (EC 
3.5.1.25) (GLCNAC 6-P 
DEACETYLASE). 
unknown 
Unknwon 
Unknown, similar to 
putative protease 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
putative transcription 

regulators 
unknown, similar to 
hypothetical proteins 
Unknown, similar to 3- 
ketoacyl-acyl carrier protein 
reductase 
Unknown, similar to 

unknown protein 
Unknown, similar to 
phosphatidylglycerophosph 
ate synthase 
Unknwon 
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SeqID n° 2161 LM-3800.3 From 1423902 to 1425146 



SeqID n° 2162 LM-3801.3 From 984823 to 985722 
SeqID n° 2163 LM-3802.4 From 985743 to 986564 
SeqID n° 2164 LM-3806.3 From 1578625 to 1579425 



SeqID n° 2165 LM-3807.3 From 1577212 to 1578573 



SeqID n° 2166 
SeqID n° 2167 



LM-3808.2 
LM-3809.2 



From 2298927 
From 2298093 



to 2299712 
to 2298863 



SeqID n° 2168 LM-3810.3 From 333534 to 335195 



SeqID n° 2169 
SeqID n° 2170 



LM-3811.3 
LM-3812.3 



From 335350 
From 1662570 



to 336426 
to 1663982 



SeqID n° 2171 LM-3818.1 From 2211280 to 2212245 



SeqID n° 
SeqID n° 


2172 
2173 


LM-3819.1 
LM-382.1 


From 2210609 
From 81661 


to 2211244 
to 82617 


SeqID n° 


2174 


LM-3821.1 


From 2159697 


to 2160053 


SeqID n° 


2175 


LM-3822.1 


From 2149348 


to 2149632 


SeqID n° 


2176 


LM-3823.3 


From 2943120 


to 2943479 


SeqID n° 
SeqID n° 


2177 
2178 


LM-3825.4 
LM-3826.1 


From 2943871 
From 1240358 


to 2944341 
to 1240681 


SeqID n° 2179 


LM-383.1 


From 80892 


to 81530 


SeqID n° 


2180 


LM-3845.2 


From 979762 


to 980064 


SeqID n° 


2181 


LM-3846.1 


From 979059 


to 979529 


SeqID n° 


2182 


LM-3847.1 


From 940316 


to 940696 


SeqID n° 


2183 


LM-3848.1 


From 1357753 


to 1358697 


SeqID n° 


2184 


LM-385.1 


From 79817 


to 80869 


SeqID n° 


2185 


LM-3851.1 


From 1378840 


to 1379901 



Unknown, similar to 
competence-damage 
inducible protein CinA 
unknown 
unknown 
unknown, highly similar to 

cell division inhibitor 
(septum placement) protein 
MinD 
Unknown, similar to 
ribonuclease G 
Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to Xaa- 
His dipeptidase 
Unknown, similar to 
transcription regulator, Lacl 
family 
Unknown 
Unknown, similar to 
phosphoglycerate 
dehydrogenase 
Unknown, similar to 
unknown protein 
class I heat-shock protein 
(chaperonin) GroES 
ribonuclease P protein 
component 
hypothetical protein 
unknown, similar to 

unknown protein 
Unknwon, conserved 
hypothetical protein 
unknown, similar to B. 
subtilis YneR protein 
non-heme iron-binding 

ferritin 
unknown, conserved 
hypothetical protein 
unknown, highly similar to 
riboflavin kinase and FAD 

synthase 
Unknwon, similar to E. coli 
Ada protein (06- 
methylguanine-DNA 
methyltransferase) 
Unknown, similar to 
aminopeptidase P 
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SeqID n° 2186 LM-3853.2 From 501693 to 501959 

SeqID n° 2187 LM-3856.2 From 1953836 to 1954237 

SeqID n° 2188 LM-3857.1 From 1963240 to 1963605 

SeqID n° 2189 LM-3858.2 From 2030744 to 2031196 



SeqID n° 2190 LM-386.1 From 79047 



SeqID n° 2191 

SeqID n° 2192 

SeqID n°2193 

SeqID n° 2194 

SeqID n° 2195 
SeqID n° 2196 
SeqID n° 2197 



LM-3863.2 

LM-3865.2 

LM-3867.1 

LM-3868.2 

LM-3869.2 
LM-387.1 
LM-3871.1 



From 2494891 

From 200094 

From 1890288 

From 1428945 

From 1440748 

From 78362 
From 2100845 



to 79820 

to 2495268 

to 200402 

to 1890653 

to 1429748 

to 1441296 

to 78811 
to 2101366 



SeqID n° 2198 LM-3872.1 From 2100224 to 2100751 



SeqID n° 2199 LM-3873.1 From 2891649 to 2892032 

SeqID n° 2200 LM-3876.1 From 520961 to 521536 

SeqID n° 2201 LM-3878.2 From 640625 to 641215 

SeqID n° 2202 LM-3879.2 From 640300 to 640632 

SeqID n° 2203 LM-388.1 From 78090 to 78374 

SeqID n° 2204 LM-3883.1 From 1664057 to 1664470 



SeqID n° 2205 LM-3886.1 From 1702695 to 1703315 

SeqID n° 2206 LM-3887.1 From 1038336 to 1038557 

SeqID n° 2207 LM-3889.1 From 2524168 to 252491 1 

SeqID n° 2208 LM-389.1 From 77149 to 77406 

SeqID n° 2209 LM-3890.1 From 2523835 to 2524068 

SeqID n° 2210 LM-3891.1 From 1724196 to 1724462 



SeqID n° 221 1 LM-3893.1 From 611240 to 612634 
SeqID n° 2212 LM-3896.1 From 1546608 to 1547885 



Unknown, weakly similar to 

transposase 
unknown, similar to similar 
to RNase HI 
unknown, similar to 
conserved hypothetical 
proteins 
Unknown, similar to 
transcriptional regulator 
(Fur family) 
Unknwon, similar to 
carboxyphosphonoenolpyr 
uvate phosphonomutase 
Unknown, similar to glycine 
cleavage system protein H 
Unknwon, similar to B. 
subtilis SpoVG protein 
Unknown, similar to 

unknown protein 
Unknown, conserved 
hypothetical protein 
Unknown 
Unknown 
Unknown, similar to 
unknown protein 
Unknown, similar to cell- 
division initiation protein 

(septum placement) 
Unknown, hypothetical 
secreted protein 
Unknown 
Unknown 
Unknown, conserved 
hypothetical protein 
unknown 
Unknown, weakly similar to 
E. coli MutT protein (dGTP 
pyrophosphohydrolase 
unknown, putative 
cellsurface protein 

unknown 
Unknown, similar to 
carboxylesterase 
unknown 
Unknown 
unknown, similar to 
conserved hypothetical 
proteins 
Unknown, similar to beta- 
glucosidase 
Unknown, similar to 
unknown protein 
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SeqID n° 2213 LM-3897.3 From 1514257 to 1515288 

SeqID n° 2214 LM-3898.1 From 1594535 to 15951 19 

SeqID n° 2215 LM-39.1 From 2736597 to 2737628 

SeqID n° 2216 LM-390.3 From 76262 to 76984 

SeqID n° 2217 LM-3902.4 From 1715236 to 1716435 

SeqID n° 2218 LM-3904.3 From 819389 to 820039 

SeqID n° 221 9 LM-3905.2 From 502899 to 504629 

SeqID n° 2220 LM-3907.1 From 1234779 to 1235330 

SeqID n° 2221 LM-3908.1 From 1234465 to 1234776 

SeqID n° 2222 LM-3910.2 From 1409070 to 1409996 

SeqID n° 2223 LM-3912.1 From 2709522 to 2710421 

SeqID n° 2224 LM-3914.1 From 2297304 to 2297981 

SeqID n° 2225 LM-3915.1 From 2145127 to 2145513 



SeqID n° 2226 
SeqID n° 2227 
SeqID n° 2228 
SeqID n° 2229 
SeqID n° 2230 
SeqID n°2231 



LM-392.3 
LM-3922.2 
LM-3924.2 
LM-3929.1 

LM-393.3 
LM-3934.1 



From 75832 
From 987735 
From 986722 
From 833615 

From 75342 
From 356567 



to 76125 
to 988049 
to 987180 
to 834076 

to 75665 
to 356872 



SeqID n° 2232 LM-3942.3 From 2633930 to 2634850 

SeqID n° 2233 LM-3943.4 From 1576107 to 1576397 

SeqID n° 2234 LM-3944.4 From 1576411 to 1576728 

SeqID n° 2235 LM-3947.1 From 1599814 to 1600278 

SeqID n° 2236 LM-3949.2 From 1638420 to 1638908 

SeqID n° 2237 LM-395.2 From 74239 to 75228 



SeqID n° 2238 LM-3950.2 

SeqID n° 2239 LM-3951.1 

SeqID n° 2240 LM-3953.1 

SeqID n° 2241 LM-3954.2 



From 1639120 to 1639722 

From 1640225 to 1641004 

From 502424 to 502660 

From 504674 to 504997 



Unknown, similar to 

unknown protein 
Unknown, similar to 
hypothetical GTP binding 

protein 
Unknown, similar to polyol 
dehydrogenase 
unknown 
Unknown, similar to S- 

methionine 
adenosyltransferase 
Unknown 
Unknown 
unknown 
unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, conserved 

lipoprotein 
Unknown, similar to 
unknown protein 
Unknown, similar to large 

conductance 
mechanosensitive channel 
protein 
unknown 
Unknown 
Unknown 
unknown 
Unknwon 
Unknown, similar to 

transposase 
Unknown, conserved 
hypothetical protein 
ribosomal protein L27 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
dinitrogenase reductase 
ADP-ribosylation system 
ribosomal protein S4 
Unknown 
Hypothetical orf 
Unknown 
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SeqID n° 2242 


LM-3955.1 


From 1307763 


to 1308170 


SeqID n° 2243 
SeqID n° 2244 


LM-3957.1 
LM-3958.1 


From 2360908 
From 2360435 


to 2361141 
to 2360713 


SeqID n° 2245 
SeqID n° 2246 


LM-3959.2 
LM-396.2 


From 2359948 
From 1859272 


to 2360184 
to 1859787 


SeqID n° 2247 


LM-3961.1 


From 2494522 


to 2494806 


SeqID n 2248 


LM-3966.4 


From 1739674 


to 1740849 


beqiD n 2249 
SeqID n° 2250 


LM-397.3 
LM-3970.1 


From 1858651 
From 1933037 


to 1859010 
to 1933270 


SeqID n° 2251 


LM-3972.2 


From 818789 


to 819265 


SeqID n 2252 


LM-3973.2 


From 1136900 


to 1137400 


SeqID n 2253 


LM-3976.4 


From 267530 


to 267916 


SeqID n° 2254 
SeqID n 2255 
SeqID n° 2256 


LM-3978.3 
LM-3979.3 
LM-398.1 


From 267010 
From 266431 
From 1857856 


to 267372 
to 266931 
to 1858611 


beqID n 2257 


LM-3981.1 


From 2361897 


to 2362274 


oeqiu n 2258 


LM-3982.1 


From 2362306 


to 2362656 


SeqID n° 2259 


LM-3984.1 


From 270372 


to 270977 


SeqID n° 2260 
SeqID n° 2261 


LM-399.1 
LM-3990.1 


From 1857242 
From 222983 


to 1857772 
to 223261 


SeqID n° 2262 


LM-3993.1 


From 200522 


to 200830 


SeqID n° 2263 
SeqID n° 2264 


LM-3995.1 
LM-3998.1 


From 143016 
From 823638 


to 143258 
to 824168 


SeqID n° 2265 


LM-3999.3 


From 810985 


to 81 1620 


SeqID n° 2266 


LM-4.1 


From 2713398 


to 2713940 



unknown, conserved 
hypothetical protein, similar 
to B. subtilis YneT protein 

Unknown 
Unknown, similar to 
competence transcription 
factor ComK, N terminal 
part 
Unknown 
translation initiation factor 
IF-3 

Unknown, similar to 

thioredoxin 
unknown, similar to 
transmembrane transport 
proteins 
ribosomal protein L20 
unknown, similar to 
hypoyhetical protein 
Unknown, similar to 
transcription regulator 
(EbsC from Enterococcus 
faecalis) 
Unknown, similar to 
lipoprotein signal peptidase 
Unknown, similar to 
repressor (penicilinase 

repressor) 
ribosomal protein L12 
ribosomal protein L10 
Unknown, similar to 3 -exo- 
deoxyribonuclease exoA 
protein gp30 
[Bacteriophage A1 18] 

protein gp29 
[Bacteriophage A1 18] 
Unknown, conserved 
hypothetical protein 
Unknown 
Unknwon, highly similar to 
B. subtilis YabO protein 
Unknown, similar to B. 
subtilis SpoVG protein 

unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to acyl- 

carrier protein 
phosphodiesterase and to 
NAD(P)H dehydrogenase 
Unknown 
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SeqID n° 2267 LM-40.1 From 2737630 to 2738682 

SeqID n° 2268 LM-4003.2 From 1513917 to 1514171 
SeqlDn°2269 LM-401.1 From 1855952 to 1857184 



SeqlDn°2270 LM-4013.2 From 1926482 to 1926898 



SeqID n° 2271 LM-4015.1 From 1771594 to 1772040 



SeqlDn°2272 LM-402.1 From 1855571 
SeqID n° 2273 LM-4027.2 From 263535 



SeqID n° 2274 LM-4029.2 From 263844 
SeqID n° 2275 LM-403.1 From 1854748 



SeqID n° 2276 LM-4031.1 From 264432 

SeqID n° 2277 LM-4032.1 From 265029 

SeqID n° 2278 LM-4033.1 From 265494 

SeqID n° 2279 LM-404.1 From 1854077 

SeqID n° 2280 LM-4040.1 From 504972 

SeqID n° 2281 LM-4041.1 From 552563 

SeqID n° 2282 LM-4053.1 From 1270465 

SeqID n° 2283 LM-4055.2 From 1288261 



SeqID n° 2284 
SeqID n° 2285 
SeqID n° 2286 



SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 



n° 2287 
n° 2288 
n° 2289 
n° 2290 
n° 2291 
n° 2292 
n° 2293 
n° 2294 
n° 2295 
n° 2296 



LM-4058.1 
LM-4059.1 
LM-4065.1 



LM-4082.1 
LM-4083.1 
LM-4084.1 
LM-4088.1 
LM-4089.1 
LM-4090,1 
LM-4091.1 
LM-4096.1 
LM-4097.1 
LM-4098.1 



From 1405945 
From 1406366 
From 2045765 



From 2782739 
From 2780577 
From 2780392 
From 2701254 
From 2700963 
From 2699383 
From 2697267 
From 367248 
From 370416 
From 389538 



to 1855933 
to 263714 



to 264377 
to 1855521 



to 264902 
to 265454 
to 266183 
to 1854682 

to 505277 
to 552931 

to 1270725 
to 1288926 



to 1406226 
to 1406713 
to 2046046 



to 2782894 
to 2780798 
to 2780574 
to 2701445 
to 2701226 
to 2699568 
to 2697446 
to 367532 
to 370634 
to 389717 



SeqID n° 2297 LM-4099.1 From 401386 to 401580 



Unknown, similar to sorbitol 
dehydrogenase 
ribosomal protein S20 
Unknown, similar to 
aminotripeptidase 

(peptidase T) 
unknown, similar to 
transcriptional regulator 
(MarR family) 
unknown, similar to 
putative flavodoxin 
Unknown 
unknown, highly similar to 
preprotein translocase 
subunit 
transcription 
antitermination factor 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown 
ribosomal protein L1 1 
ribosomal protein L1 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
unknown protein 
unknown 
unknown, weakly similar to 
oligopeptide ABC 
transporter AppA 
Unknown 
Unknown 
Unknown, similar to pentitol 
PTS system enzyme II B 
component 
Unknown 
Unknown 
Unknown 
ribosomal protein L29 
ribosomal protein S17 
ribosomal protein S14 
ribosomal protein L30 
Unknown 
Unknown 
Unknown, similar to 
conserved hypothetical 
protein 
unknown 
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SeqlDn°2298 LM-41.1 From 2738719 to 2739990 



SeqlDn°2299 LM-4106.1 From 439009 to 439212 



SeqID n° 2300 LM-41 13.1 From 521601 
SeqlDn°2301 LM-41 15.1 From 616742 



SeqID n° 2302 LM-41 18.1 

SeqID n° 2303 LM-4119.1 

SeqID n° 2304 LM-4120.1 

SeqID n° 2305 LM-4121.1 

SeqID n° 2306 LM-41 22.1 



From 669086 
From 696606 
From 707310 
From 709601 
From 731765 



to 521771 
to 616951 

to 669361 
to 696815 
to 707486 
to 709810 
to 732001 



SeqID n° 2307 LM-4123.1 From 756079 to 756291 



SeqID n° 2308 
SeqID n° 2309 



LM-4124.1 
LM-41 30.1 



From 756494 
From 890069 



to 756637 
to 890215 



SeqID n° 2310 LM-4134.1 From 961203 to 961469 



SeqID n° 2311 
SeqID n° 2312 
SeqID n° 2313 
SeqID n° 2314 
SeqID n° 2315 



LM-41 35.1 
LM-4137.1 
LM-4139.1 
LM-4142.1 
LM-4144.1 



From 973509 
From 981317 
From 987274 
From 1003593 
From 1057067 



SeqID n° 2316 LM-4147.1 From 1133466 

SeqID n° 2317 LM-4148.1 From 1145723 

SeqID n° 2318 LM-4149.2 From 1154864 

SeqID n° 2319 LM-415.1 From 1848166 



to 973670 
to 981544 
to 987501 
to 1003829 
to 1057276 

to 1133696 

to 1 145944 

to 1156381 
to 1848690 



SeqID n° 2320 LM-4150.1 From 1188014 to 1188289 

SeqID n° 2321 LM-4152.2 From 1287376 to 1287591 

SeqID n° 2322 LM-4153.1 From 1331439 to 1331666 

SeqID n° 2323 LM-41 54.1 From 1363826 to 1363975 

SeqID n° 2324 LM-4155.1 From 1366220 to 1366432 

SeqID n° 2325 LM-4157.1 From 1385704 to 1385931 

SeqID n° 2326 LM-4158.1 From 1387014 to 1387214 

SeqID n° 2327 LM-4161.1 From 1501881 to 1502054 



Unknwon, similar to PTS 
system galactitol-specific 
enzyme IIC component 
Unknown, similar to 
putative transcription 

regulator 
ribosomal protein L32 
Unknown, similar to 
unknown protein 
Unknown 
unknown 
unknown 
unknown 
Unknown, weakly similar to 
flagellar switch protein 
unknown, LPXTG motif 
protein 
Hypothetical CDS 
Unknown, hypothetical 
protein 
Unknown, similar to ABC 
transporter, ATP-binding 
protein (C-terminal part) 
unknown 
unknown 
Unknown 
D-alanyl carrier protein 
unknown, similar to B. 
subtilis YkzG protein 
unknown, highly similar to 

TN916 0RF8 
unknown, highly similar to 
TN916 0RF19 
unknown 
Unknown, similar to 

unknown protein 
Unknwon, similar to 
carboxysome structural 
protein 
unknown, similar to 
transcriptional regulator 
unknown, similar to B. 
subtilis YnzC protein 
ribosomal protein L33 
unknown similar to B. 

subtilis yqgQ 
Unknown, similar to 
exodeoxyribonuclease 

small subunit 
Similar to cold shock 
protein 
30S ribosomal protein S21 
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SeqlDn°2328 LM-4162.2 From 1576747 to 1577055 
SeqID n° 2329 LM-4166.1 From 1654691 to 1654909 

SeqlDn°2330 LM-417.1 From 1847472 to 1847960 



SeqID n° 2331 
SeqID n° 2332 
SeqID n° 2333 
SeqID n° 2334 
SeqID n° 2335 



LM-4172.1 
LM-4174.1 
LM-4175.1 
LM-4179.1 
LM-418.1 



From 1756629 
From 1764871 
From 1769479 
From 1859050 
From 1846355 



SeqID n° 2336 LM-4180.1 From 1867180 
SeqID n° 2337 LM-4181.1 From 1881041 



SeqID n° 2338 
SeqID n° 2339 
SeqID n° 2340 



LM-4182.1 
LM-4183.1 
LM-4184.1 



From 1890951 
From 1902281 
From 1928579 



SeqID n° 2341 LM-4186.1 From 1953513 
SeqID n° 2342 LM-4189.1 From 2094877 



SeqID n° 2343 
SeqID n° 2344 
SeqID n° 2345 
SeqID n° 2346 
SeqID n° 2347 
SeqID n° 2348 



LM-419.1 
LM-4192.1 
LM-4 193.1 
LM-4195.1 
LM-4197.1 

LM-420.1 



From 
From 
From 
From 
From 
From 



1845044 
2144874 
2150556 
2184120 
2192379 
1844250 



SeqID n° 2349 LM-4200.1 From 2234357 
SeqID n° 2350 LM-4201.1 From 2242097 



to 1756862 
to 1765077 
to 1769700 
to 1859250 
to 1847479 

to 1867410 

to 1881274 

to 1891139 
to 1902484 
to 1928785 

to 1953713 

to 2095077 

to 1846336 
to 2145065 
to 2150759 
to 2184347 
to 2192609 
to 1844963 



to 2234533 
to 2242282 



SeqID n° 2351 LM-4203.1 From 2293497 to 2293685 



SeqID n° 2352 
SeqID n° 2353 
SeqID n° 2354 



LM-4206.1 
LM-4207.1 
LM-4208.1 



From 2346593 
From 2361582 
From 2366288 



SeqID n° 2355 LM-4209.1 From 2387499 
SeqID n° 2356 LM-421.1 From 1843993 



SeqID n° 2357 
SeqID n° 2358 
SeqID n° 2359 
SeqID n° 2360 
SeqID n° 2361 
SeqID n° 2362 



LM-4210.1 
LM-4211.1 
LM-4212.1 
LM-4213.1 
LM-4214.1 
LM-4215.1 



From 
From 
From 
From 
From 
From 



2388528 
2389208 
2389491 
2391020 
2394790 
2395586 



to 2346784 
to 2361791 
to 2366446 

to 2387663 

to 1844238 

to 2388710 
to 2389387 
to 2389658 
to 2391217 
to 2394984 
to 2395801 



ribosomal protein L21 
Unknown, hypothetical 
gene 

hosphoribosylaminoimidaz 
ole carboxylase I 
unknown 
unknown 
unknown 
ribosomal protein L35 
phosphoribosylaminoimida 
zole carboxylase II 
Unknown, similar to 
unknown protein 
unknown, highly similar to 
acyl carrier proteins 
ribosomal protein L28 
unknown 
unknown, similar to 
putative mercuric ion 

binding proteins 
similar to cold shock 
protein 
similar to major cold-shock 
protein 
adenylosuccinate lyase 
unknown 
unknown 
Unknown 
Unknown 
phosphoribosylaminoimida 
zole succinocarboxamide 
synthetase 
unknown 
unknown, similar to B. 
subtilis YwmG protein 
Unknown, similar to 
unknown protein 
unknown, 
unknwon 
protein gp22 
[Bacteriophage A1 18] 
Bacteriophage A1 18 gp65 
protein 
Unknown, similar to 

unknown protein 
Hypothetical protein 
Unknown 
unknown 
Unknown 
Unknown 
gp44 [Bacteriophage A1 18] 
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SeqID n° 2363 LM-4216.1 From 2398287 to 2398529 
SeqID n° 2364 LM-422.1 From 1843306 to 1843989 

SeqID n° 2365 LM-4225.1 From 2467388 to 2467621 



SeqID n° 2366 
SeqID n° 2367 

SeqID n° 2368 
SeqID n° 2369 

SeqID n° 2370 

SeqID n° 2371 

SeqID n° 2372 

SeqID n° 2373 
SeqID n° 2374 

SeqID n° 2375 
SeqID n° 2376 

SeqID n° 2377 

SeqID n° 2378 

SeqID n° 2379 

SeqID n° 2380 



LM-4226.1 
LM-4227.1 

LM-4228.1 
LM-423.1 

LM-4230.1 

LM-4231.1 

LM-4233.1 

LM-4234.1 
LM-4235.1 

LM-4236.1 
LM-4246.1 

LM-4247.1 

LM-4248.1 

LM-4249.1 

LM-425.1 



From 2468545 
From 2480325 

From 2501306 
From 1841094 

From 2559617 

From 2567696 

From 2611192 

From 2624633 
From 2643753 

From 2646464 
From 145354 

From 145085 

From 136714 

From 136469 

From 1839682 



Seq I D n° 238 1 LM-425 1.1 From 77863 

SeqID n° 2382 LM-4253.1 From 52373 

SeqID n° 2383 LM-4254.1 From 50514 

SeqID n° 2384 LM-426.1 From 1838614 



to 2468703 
to 2480528 

to 2501470 
to 1843313 

to 2559817 

to 2567935 

to 261 1410 

to 2624878 
to 2643938 

to 2646664 
to 145560 

to 145171 

to 136992 

to 136702 

to 1841109 



to 78066 
to 52534 
to 50753 
to 1839663 



SeqID n° 2385 LM-4262.2 From 267909 to 268949 

SeqID n° 2386 LM-4267.1 From 1153010 to 1153783 

SeqID n° 2387 LM-4268.1 From 1153848 to 1154204 

SeqID n° 2388 LM-427.1 From 1838051 to 1838617 



SeqID n° 2389 LM-4276.2 From 2943569 to 2943703 
SeqID n° 2390 LM-4277.1 From 2693947 to 2694060 



Unknown, similar to 
transcription regulator 
Unknown, similar to 
phosphoribosylformylglycin 
amidine synthetase II 
Unknown, similar to B. 
subtilis YuzB protein 

Unknown 
Unknown, similar to 
repressor protein 
Unknown 
phosphoribosylformylglycin 
amidine synthetase I 
Unknown, similar to B. 

subtilis yvIC protein 
Unknown, similar to B. 
subtilis CsbA protein 
unknown, highly similar to 
H+-transporting ATP 

synthase chain c 
ribosomal protein L31 
Unknwon, similar to 4- 
oxalocrotonate isomerase 
unknown 
Unknwon, hypothetical 

protein 
Unknwon, hypothetical 
protein 
Unknwon, similar to E. coli 

YjdJ protein 
Unknown, similar to E. coli 
Yjdl protein 
glutamine 
phosphoribosylpyrophosph 
ate amidotransferase 
Unknown, Hypothetical 
Unknown 
30S ribosomal protein S18 
phosphoribosylaminoimida 
zole synthetase 
Unknown, similar to 
penicillinase antirepressor 
unknown, similar to 
regulatory proteins 
unknown 
unknown, highly similar to 
phosphoribosylglycinamide 
formyltransferases 
ribosomal protein L34 
ribosomal protein L36 
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SeqID n° 2391 LM-428.1 From 1836516 to 1838045 



SeqID n° 2392 LM-429,1 From 1835229 to 1836491 

SeqID n° 2393 LM-4293,1 From 2130228 to 2130401 

SeqID n° 2394 LM-4295. 1 From 2052696 to 2052833 

SeqID n° 2395 LM-43.1 From 2740052 to 2740333 



SeqID n° 2396 LM-430. 1 From 1834795 to 1835091 

SeqID n° 2397 LM-432. 1 From 1 834523 to 1 834777 
SeqID n° 2398 LM-433.1 From 1833125 to 1834480 



SeqID n° 2399 LM-4342.1 From 2317644 to 2317841 

SeqID n° 2400 LM-4349.1 From 1287710 to 1288243 
SeqID n° 2401 LM-435.1 From 1832242 to 1832919 



SeqID n° 2402 
SeqID n° 2403 



LM-4352.1 From 436161 
LM-4353.1 From 2250846 



SeqID n 6 2404 LM-4358.1 From 2491285 

SeqID n° 2405 LM-4359.1 From 2173635 

SeqID n° 2406 LM-436.2 From 1829982 

SeqID n° 2407 LM-4360.1 From 263366 

SeqID n° 2408 LM-438. 1 From 1 827941 

SeqID n° 2409 LM-439.1 From 1826829 

SeqID n° 2410 LM-44.1 From 2740390 



to 436352 
to 2251004 
to 2491473 
to 2173832 
to 1832177 

to 263515 
to 1829956 

to 1827944 

to 2740854 



SeqID n° 2411 LM-440.1 From 1826411 to 1826704 

SeqID n° 2412 LM-441.1 From 1824936 to 1826387 

SeqID n° 2413 LM-442.1 From 1823494 to 1824924 

SeqID n° 2414 LM-443.2 From 1822421 to 1823353 

SeqID n° 2415 LM-444.2 From 1821518 to 1822273 



Bifunctional 
phosphoribosylaminoimida 
zole carboxy formyl 
formyltransferase and 
inosine-monophosphate 
cyclohydrolase 

phosphoribosylglycinamide 
synthetase 
ribosomal protein L32 
Unknown 
Unknwon, similar to PTS 
system galactitol-specific 
enzyme IIB component 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
putative sodium-dependent 
transporter 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
unknown protein 
Unknown 
Unknown 
unknown 

ATP-dependent DNA 
helicase 

Unknown, similar to DNA 
ligase 
Unknown, similar to 
unknown protein 
Unknwon, similar to PTS 
system galactitol-specific 
enzyme IIA component 
glutamyl-tRNA(Gln) 
amidotransferase (subunit 
C) 

glutamyl-tRNA(Gln) 
amidotransferase (subunit 
A) 

glutamyl-tRNA(Gln) 
amidotransferase (subunit 
B) 

Unknown, similar to 
unknown protein 
Unknown 
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SeqID n° 2416 LM-445.1 From 1820133 to 1821494 



SeqIDn 0 2417 
SeqID n° 2418 
SeqID n° 2419 



LM-446. 1 From 1 81 9244 to 1 81 9798 
LM-447.2 From 1818740 to 1819213 
LM-45.2 From 2740888 to 2742957 



SeqID n° 2420 LM-451 . 1 From 2435023 to 2436141 



SeqID n° 2421 
SeqID n° 2422 



LM-452. 1 From 2434627 to 2434866 
LM-453. 1 From 2433224 to 2434618 



SeqID n° 2423 LM-455.1 From 2431688 to 2433211 



SeqID n° 2424 
SeqID n° 2425 
SeqID n° 2426 
SeqID n° 2427 



LM-456. 1 From 2430826 to 2431296 

LM-458. 1 From 242801 0 to 2430793 

LM-460. 1 From 2427045 to 2427890 

LM-463. 1 From 2426270 to 2427001 



SeqID n° 2428 LM-464.1 From 2425723 to 2426253 



SeqID n° 2429 
SeqID n° 2430 



LM-466. 1 From 2424922 to 2425533 
LM-468.1 From 2423480 to 2424721 



SeqID n° 2431 LM-469.1 From 2421878 to 2423476 



SeqID n° 2432 
SeqID n° 2433 



LM-47.2 From 2743326 to 27441 38 
LM-471.1 From 2419797 to 2421749 



SeqID n° 2434 LM-472.1 From 2418871 to 2419770 



SeqID n° 2435 
SeqID n° 2436 
SeqID n° 2437 



LM-473. 1 From 241 8083 to 241 8628 
LM-475.1 From 2417531 to 2418067 
LM-476.1 From 2416705 to 2417514 



SeqID n° 2438 LM-477.1 From 2415967 to 2416683 



Unknown, similar to 
hypothetical RNA 
methyltransferase 
Unknown, similar to 

unknown protein 
Unknown, similar to 

shikimate kinase 
unknown, similar to 
transcriptional 
antiterminator (BgIG family) 

Unknown, similar to S. 
pyogenes RofA regulatory 
protein 
Hypothetical protein 
Unknown, similar to 
glutamate decarboxylase 
Unknown, similar to amino 
acid antiporter (acid 

resistance) 
Unknown, conserved 
hypothetical protein 
Unknown, transmembrane 
protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to N- 
acetylgiucosamine-6- 
phosphate isomerase 
Unknown, similar to 
unknown protein 
Unknown 
Unknown; Similar to 
multidrug resistance 

protein 
Unknown, conserved 
hypothetical protein 

Unknown 
Unknown, similar to 
putative Na+/H+ antiporter 
Unknown, similar to LysR 
family transcription 
regulator 
Unknown, similar to NADH- 
dependent FMN reductase 
Unknown, similar to B. 
subtilis Ytml protein 
Unknown, similar to amino 
acid ABC transporter 
(binding protein) 
Unknown, similar to amino 
acid ABC-transporter 
(permease) 
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SeqID n° 2439 LM-478.1 From 2415246 to 2415953 



SeqID n° 2440 
SeqID n° 2441 

SeqtD n° 2442 
SeqID n° 2443 
SeqID n° 2444 



LM-48.1 From 2744212 to 2744574 

LM-480.1 From 2414470 to 2415249 

LM-481.1 From 2413478 to 2414473 

LM-482.1 From 2413219 to 2413485 

LM-483.1 From 241 1900 to 2413222 



SeqID n° 2445 LM-484.1 From 241 1 126 to 241 1827 



SeqID n° 2446 
SeqID n°2447 



LM-485. 1 From 2409875 to 241 0993 
LM-486. 1 From 2408967 to 2409878 



SeqID n° 2448 LM-487.1 From 2408450 to 2408893 



SeqID n° 2449 
SeqID n° 2450 
SeqID n° 2451 



SeqID n° 2452 
SeqID n° 2453 

SeqID n° 2454 
SeqID n° 2455 
SeqID n° 2456 
SeqID n° 2457 

SeqID n° 2458 
SeqID n° 2459 



LM-488.1 From 2407128 to 2408453 
LM-49.1 From 2744571 to 2744939 
LM-490. 1 From 24061 61 to 240691 3 



LM-491.1 From 2405241 to 2406164 

LM-492.1 From 2403341 to 2405239 

LM-494.1 From 2402888 to 2403235 

LM-495. 1 From 2402357 to 2402833 

LM-496.1 From 2401008 to 2402366 

LM-497. 1 From 2400264 to 2400944 

LM-498. 1 From 2399542 to 2400243 

LM-5. 1 From 271 3962 to 271 4942 



SeqID n° 2460 LM-500.1 From 2398685 to 2399161 



Unknown, similar to amino 
acid ABC transporter 

(permease) 
Unknown, conserved 
hypothetical protein 
Unknown, similar to amino 
acid ABC-transporter, ATP- 
binding protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to B. 
subtilis Ytnl protein 
Unknown, similar to 
nitrilotriacetate 
monooxygenase 
Unknwon, similar to 16S 
pseudouridylate synthase 
Similar to kinases 
Unknown, similar to 
Erwinia chrysanthemi IndA 
protein 
Unknown, conserved 
hypothetical protein 
aminopeptidase C 

Unknown 
Unknown, similar to 
regulatory protein DeoR 
family 

fructose- 1 -phosphate 
kinase 

unknown, highly similar to 
phosphotransferase 
system (PTS) fructose- 
specific enzyme IIABC 
component 
Unknown, similar to 
transcriptional regulator 
competence transcription 
factor 
putative integrase 
[Bacteriophage A1 18] 
Unknown, weakly similar to 
gp32_Bacteriophage A1 18 

protein 
Unknown, similar to protein 
gp33 [Bacteriophage A118] 
Unknown, similar to 
heptaprenyl diphosphate 
synthase component II 
Unknown, similar to a 
putative repressor protein 
[Bacteriophage A1 18] 
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SeqID n° 2461 
SeqID n° 2462 



LM-502.1 From 2397800 to 2398084 
LM-503. 1 From 2397493 to 2397774 



SeqID n° 2463 LM-506. 1 From 2396454 to 2397230 



SeqID n° 2464 
SeqID n° 2465 



LM-507. 1 From 2395798 to 2396331 
LM-509. 1 From 239431 7 to 2394793 



SeqID n° 2466 LM-51 .1 From 2744985 to 2745791 



SeqID n° 2467 
SeqID n° 2468 



LM-510.1 From 2393613 to 2394311 
LM-51 2.1 From 2392622 to 2393596 



SeqID n° 2469 LM-513.1 From 2391813 to 2392625 



SeqID n° 2470 

SeqID n° 2471 

SeqID n° 2472 
SeqID n° 2473 

SeqID n° 2474 
SeqID n° 2475 



SeqID n° 2476 
SeqID n° 2477 
SeqID n° 2478 

SeqID n°2479 
SeqID n° 2480 



SeqID n° 2481 
SeqID n° 2482 

SeqID n° 2483 
SeqID n° 2484 
SeqID n° 2485 



LM-514.1 From 2391220 to 2391816 

LM-517.1 From 2390580 to 2391023 

LM-51 8.1 From 23901 13 to 2390583 

LM-52. 1 From 2745907 to 2746377 

LM-520. 1 From 2389655 to 23901 16 

LM-522.1 From 2388729 to 2389211 



LM-523. 1 From 23881 79 to 2388583 

LM-524. 1 From 2387792 to 23881 75 

LM-525. 1 From 2387046 to 2387480 

LM-527. 1 From 23861 41 to 2386680 

LM-529.1 From 2385301 to 2386095 



LM-53. 1 From 2746423 to 2746878 

LM-530.1 From 2384001 to 2385332 

LM-53 1 . 1 From 2382228 to 2383988 

LM-532. 1 From 2381 088 to 2382227 

LM-533.1 From 2380419 to 2381009 



SeqID n° 2486 LM-535.1 From 2379418 to 2380419 



Unknown 
Unknown, similar to protein 
gp41 [Bacteriophage A1 18] 
Unknown, similar to anti- 
repressor [Bacteriophage 
A118] 

gp43 [Bacteriophage A118] 
Unknown, similar to 
bacteriophage proteins 
Unknown, weakly similar to 
AraC-like transcription 
regulator 
Unknown 
Unknown, similar to protein 
gp49 [Bacteriophage A1 18] 
Unknown, similar to site- 
specific DNA- 
methyltransferase 
Unknown, similar to protein 
gp51 [Bacteriophage A1 18] 
Unknown, similar to a 
bacteriophage protein 

Unknown 
Unknown, conserved 
hypothetical protein 
Unknown 
unknown, similar to single- 
stranded DNA-binding 
protein 
Unknown 
unknown 
Protein gp66 
[Bacteriophage A1 18] 

Unknown 
Unknown, similar to 
putative terminase small 
subunit from Bacteriophage 
A118 

Unknown, similar to ribose 
5-phosphate epimerase 
putative terminase large 

subunit from Bacteriophage 
A118 

putative portal protein 
[Bacteriophage A1 18] 
Protein gp4 [Bacteriophage 
A118] 
Unknown, putative 
scaffolding protein 
[Bacteriophage A1 18] 
Unknown, similar to coat 
protein [Bacteriophage 
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oeqiu n z4ov 


LM-536.1 


From 2379004 


to 2379399 


oeqiu n z4oo 


LM-537.1 


From 2378642 


to 2379004 


SeqID n° 2489 


LM-538.1 


From 2378304 


to 2378642 


SeqID n° 2490 


LM-539.1 


From 2377897 


to 2378304 


SeqID n° 2491 
oeqiu n Z4yz 


LM-54.1 
LM-540.1 


From 2747073 
From 2377460 


to 2747414 
to 2377894 


oeqiu n ^4yo 


LM-041 .1 


From 23771 98 


to 2377530 


SeqID n° 2494 


LM-542.1 


From 2376721 


to 2377143 


SeqID n° 2495 


LM-543.1 


From 2376110 


to 2376715 


SeqID n° 2496 


LM-548.1 


From 2370736 


to 2376099 



SeqID n° 2497 

SeqID n° 2498 

SeqID n° 2499 

SeqID n° 2500 

SeqID n° 2501 

SeqID n° 2502 

SeqID n° 2503 

SeqID n° 2504 
SeqID n° 2505 

SeqID n° 2506 
SeqID n° 2507 



LM-549.1 

LM-55.1 

LM-550.1 

LM-552.1 

LM-553.1 

LM-554.1 

LM-555.1 

LM-557.1 
LM-558.1 

LM-559.1 
LM-56.1 



From 2369916 

From 2747427 

From 2368882 

From 2367853 

From 2366780 

From 2366451 

From 2365893 

From 2365599 
From 2364754 

From 2363709 
From 2748685 



to 2370734 

to 2748683 

to 2369907 

to 2368881 

to 2367853 

to 2366768 

to 2366258 

to 2365880 
to 2365599 

to 2364260 
to 2749497 



SeqID n° 2508 LM-560.1 From 2363135 to 2363632 

SeqID n° 2509 LM-562.3 From 2362661 to 23631 10 

SeqID n° 2510 LM-563.3 From 318 to 1673 

SeqID n° 2511 LM-564.1 From 1867 to 3012 



SPP1] 

Protein gp8 [Bacteriophage 
A118] 

Protein gp9 [Bacteriophage 
A118] 
Protein gp1 0 
[Bacteriophage A1 18] 

Portein gp1 1 
[Bacteriophage A1 18] 

Unknown 
major tail shaft protein 
[Bacteriophage A1 18] 

Portein gp13 
[Bacteriophage A1 18] 

Protein gp14 
[Bacteriophage A1 18] 

Protein gp15 
[Bacteriophage A1 18] 
Unknown, putative tape- 
measure [Bacteriophage 
A118] 
Protein gp17 
[Bacteriophage A1 18] 
Unknown, similar to UV- 
damage repair protein 

Protein gp18 
[Bacteriophage A1 18] 

Protein gp19 
[Bacteriophage A1 18] 

protein gp20 
[Bacteriophage A1 18] 

protein gp21 
[Bacteriophage A1 18] 

protein gp23 
[Bacteriophage A1 18] 
holin [Bacteriophage A118] 
L-alanoyl-D-glutamate 
peptidase 
Unknown 
Unknown, similar to 
hydrolase (esterase) 
Unknown, similar to an 
unknown bacteriophage 
protein 
Portein gp28 
[Bacteriophage A1 18] 
Chromosomal replication 

initiation protein DnaA 
DNA polymerase 111, beta 
chain 
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SeqID n° 2512 


LM-565.1 


From 3121 


to 4464 


Unknown rnn^pr\/pH 


SeqID n°2513 








hvoothetical nrntpin 


LM-566.1 


From 4644 


to 4865 


Unknown similar to R 

i n\i iuvvi i, oil 1 llldl LU 1 / . 

subtilis YaaA protein 


SeqID n° 2514 


LM-567.1 


From 4869 


to 5981 


RecF nrotpin 


SeqID n°2515 


LM-57.1 


From 2749538 


to 2750233 


Unknown similar to twn 

\_/ 1 MM IV Wll, Ol 1 1 1 1101 WJ L VV \J 

components response 


SeqID n° 2516 








reaulator 


LM-570.1 


From 6030 


to 7970 


DNA gyrase subunit B 


SeqID n°2517 


LM-573.1 


From 8065 


to 10593 


DNA avrase subunit A 


SeqID n° 2518 


LM-575.1 


From 10728 


to 12242 


Unknown similar to 

v> 1 IWVI I, Oil 1 HIGH IU 


SeqID n°2519 








cardiolipin synthase 


LM-576.1 


From 12258 


to 12776 


Ulal J III ft? IN- 


SeqID n° 2520 








acetyltransferase 


LM-578.1 


From 12918 


to 13886 


Unknown similar tn 

Wl ll\l iv/ V V 1 1, OH 1 MICH LU 

mevalonate kinase 


SeqID n° 2521 


LM-580.1 


From 13843 


to 14814 


unknown similar to 

li\l IV-/VVJ f, Oil 1 MICH LU 

mevalonate diphosphate 


SeqID n° 2522 








decarboxylase 


LM-581.1 


From 14795 


to 15874 


Unknown, similar to 
mevalonate kinases 


SeqID n° 2523 


LM-582.1 


From 16219 


to 17325 


AA3-600 auinol oxidasp 
subunit II 


SeqID n° 2524 


LM-584.1 


From 17344 


to 19323 


AA3-600 quinol oxidase 
subunit 1 


SeqID n° 2525 


LM-585.1 


From 19311 


to 19922 


AA3-600 auinol oxida^p 
subunit III 

wUk/UJ 1 1 I III 


SeqID n° 2526 


LM-586.1 


From 19924 


to 20256 


lh ir\i \\Jvvi i, i iiyi fly oil 1 llldl LU 

quinol oxidase aa3-600 
chain IV 


SeqID n° 2527 


LM-587.1 


From 20308 


to 21426 


Unknown, similar to 
Bacillus anthracis CapA 
protein (polyglutamate 
capsule biosynthesis) 


SeqID n° 2528 


LM-589.1 


From 21652 


to 23085 


beta-glucosidase 


SeqID n° 2529 


LM-591.1 


From 23132 


to 23953 


Unknown 


SeqID n° 2530 


LM-592.1 


From 24194 


to 24934 


Unknown, similar to 
transcriptional regulator 


SeqID n° 2531 








(GntR family) 


LM-593.1 


From 24950 


to 25351 


Unknown, similar to PTS 
system, fructose-specific 


SeqID n° 2532 








HA component 


LM-594.1 


From 25351 


to 25839 


Unknown, similar to PTS 
system, fructose-specific 


SeqID n° 2533 








IIB component 


LM-596.1 


From 25862 


to 26665 


Unknown, similar to PTS 
system, fructose-specific 
IIC component 


SeqID n° 2534 


LM-597.1 


From 26640 


to 27467 


Unknown, similar to PTS 
system, mannose-specific 
II D component 
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SeqID n° 2535 


LM-598.1 


From 27495 


to 28199 


Unknown, similar to 
isomerase 


SeqID n° 2536 


LM-599.1 


From 28254 


to 28895 


Unknown, similar to E. coli 
copper homeostasis 
orotein CutC 


SeqID n° 2537 


LM-6.1 


From 2715023 


to 2716363 


Unknown 


SeqID n° 2538 


LM-600.1 


From 29100 


to 31004 


Unknown, similar to PTS 
system, beta-glucosides 
specific IIABC component 


SeqID n° 2539 


LM-601.1 


From 31088 


to 31963 


Unknown, similar to E. coli 
microcin C7 self-immunity 
protein (MccF) 


SeqID n° 2540 


LM-602.1 


From 32197 


to 32535 


Unknown 


SeqID n° 2541 


LM-604.1 


From 32571 


to 33380 


Unknown, conserved 
hypothetical protein 


SeqID n° 2542 


LM-605.1 


From 33397 


to 34452 


Unknown, transcriptional 
regulator Lacl family 


SeqID n° 2543 


LM-606.1 


From 34654 


to 35619 


Unknown, similar to xylose 
repressor 


SeqID n° 2544 


LM-608.1 


From 35616 


to 38018 


Unknown, similar to 


SeqID n° 2545 


LM-609.1 


From 38031 


to 39383 


Unknown, similar to PTS 
system, cellobiose-specific 

\\C\ rnmnnnpnt 


SeqID n° 2546 


LM-610.1 


From 39385 


to 40470 


Unknown, similar to 

VJIUUVJOul 1 III ICJ 1 I U 

phosphate 
aminotransferase (C- 
terminal domain^ 


SeqID n° 2547 


LM-611.1 


From 40705 


to 41730 


Unknown, similar to 

ornithine 
carbamoyltransferase 


SeqID n° 2548 


LM-613.1 


From 41803 


to 431 88 


Unknown, similar to amino 
acid transporter 


SeqID n° 2549 


LM-614.1 


From 43175 


to 44266 


Unknown, conserved 
hypothetical protein 


SeqID n° 2550 


LM-615.1 


From 44282 


to 45223 


carbamate kinase 


SeqID n° 2551 


LM-616.1 


From 45325 


to 46434 


Unknown, conserved 
hypothetical protein 


SeqID n° 2552 


LM-617.1 


From 46451 


to 47230 


Unknown, conserved 
hypothetical protein, 
hypothetical regulator 


SeqID n° 2553 


LM-618.1 


From 47335 


to 47994 


Unknown, similar to E. coli 
DedA protein 


SeqID n° 2554 


LM-619.2 


From 48074 


to 49306 


Unknown, similar to 
arginine deiminase 


SeqID n° 2555 


LM-62.1 


From 2750230 


to 2752920 


Unknown, similar to the two 
components sensor protein 
kdpD 


SeqID n° 2556 


LM-620.2 


From 703688 


to 703951 


Unknown 
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Seql 


D 


n° 2557 


LM-621.3 


From 


703128 


Seq I 


D 


n° 2558 


LM-622.1 


From 


703961 


Seql 


D 


n° 2559 


LM-623.1 


From 


704452 


Seql 


D 


n° 2560 


LM-624.1 


From 


705380 


Seql 


D 


n° 2561 


LM-625.1 


From 


706408 


Seq 
Seql 


D 
D 


n° 2562 
n° 2563 


LM-626.1 
LM-627.1 


From 
From 


707612 
709134 


Seq 
Seq 
Seq 


D 
D 
D 


n° 2564 
n° 2565 
n° 2566 


LM-628.1 
LM-629.1 
LM-631.1 


From 
From 
From 


709829 
711121 
711428 


Seq 


D 


n° 2567 


LM-632.1 


From 


712208 


Seq 


D 


n° 2568 


LM-633.1 


From 


712483 


Seq 


D 


n° 2569 


LM-635.1 


From 


713260 


Seq 


D 


n° 2570 


LM-636.1 


From 


714353 


Seq 


D 


n° 2571 


LM-637.1 


From 


716450 


Seq 


D 


n° 2572 


LM-639.1 


From 


717670 


Seq 


D 


n° 2573 


LM-641.1 


From 


718478 


Seq 
Seq 


D 
ID 


n° 2574 
n° 2575 


LM-642.1 
LM-643.1 


From 
From 


719291 
719653 


Seq 


ID 


n° 2576 


LM-645.1 


From 


720464 


Seq 
Seq 


ID 
ID 


n° 2577 
n° 2578 


LM-647.1 
LM-648.1 


From 
From 


721301 
721824 



to 


703691 


Unknown similar to acetvl 

1 1 1 \ l ■ \-J V ¥ 1 1 j +*J II 1 ill 1 w4 \m*>—* L j 1 

transferase 


to 


704338 


Unknown similar to 
unknown protein 


to 


705387 


Unknown, similar to ABC 
transporter (ATP-binding 
protein) 


to 


706150 


Unknown, conserved 
membrane protein 


to 


707292 


Unknown, similar to 
oxidoreductase 


to 


708220 


Unknown 


to 


709562 


Unknown, similar to 
unknown protein 


to 


710749 


Unknown 


to 


711435 


Unknown 


to 


712195 


Unknown, similar to 
flagellar biosynthesfc 
protein FliP 


to 


712480 


Unknown, similar to 
flagellar biosynthesis 
protein FliQ 


in 


713244 


1 ]n&T70\A/n cjimfl^ir to 
ui luvvi i, on i nidi wj 

flagellar biosynthetic 
protein FliR 


to 


714306 


Unknown similar to 

\m/ 1 1 1 XI 1 V-V V V 1 1 j V/ II 1 III KM 1 IV 

flagellar biosynthetic 
protein flhB 


to 


716428 


Unknown similar to 
flagella-associated protein 
flhA 


to 


717673 


Unknown, similar to 
flagellar biosynthesis 
protein FlhF 


to 


718449 


Unknown, similar to 
flagellar hook-basal body 
protein FlgG 


to 


719266 


Unknown, similar to 
chemotactic 
methyltransferase CheR 


to 


719626 


Unknown 


to 


720504 


Unknown, similar to motility 
protein (flagellar motor 
rotation) MotA 


to 


721291 


Unknown, similar to motility 
protein (flagellar motor 
rotation) MotB 


to 


721801 


Unknown 


to 


723737 


Unknown, similar to 
unknown protein 
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SeqID n° 2579 LM-649. 1 From 723750 to 724658 

SeqID n° 2580 LM-65.1 From 2752936 to 2753508 

SeqID n° 2581 LM-650.3 From 724896 to 725759 

SeqID n° 2582 LM-651 .1 From 726034 to 726393 

SeqID n° 2583 LM-652.1 From 726413 to 728269 

SeqID n° 2584 LM-653. 1 From 728282 to 728581 



SeqID n° 2585 LM-654.1 From 728600 to 729010 
SeqID n° 2586 LM-657. 1 From 729026 to 730072 
SeqID n° 2587 LM-658.1 From 730074 to 730496 



SeqID n° 2588 LM-659.1 From 730516 to 731751 
SeqID n° 2589 LM-660.1 From 732022 to 733014 

SeqID n° 2590 LM-664.1 From 733017 to 734564 



SeqID n° 2591 LM-665. 1 

SeqID n° 2592 LM-668.1 

SeqID n° 2593 LM-669.1 

SeqID n° 2594 LM-670.1 

SeqID n°2595 LM-671.1 



From 734570 
From 735996 
From 737269 
From 737743 
From 738191 



SeqID n° 2596 LM-672.1 From 739723 



SeqID n° 2597 LM-674.1 From 740610 



SeqID n° 2598 LM-675.1 From 741918 



SeqID n° 2599 LM-676. 1 From 742276 
SeqID n° 2600 LM-677.1 From 742578 



SeqID n° 2601 LM-679.1 From 742991 



SeqID n° 2602 LM-680.1 From 743418 



to 735973 
to 737162 
to 737733 
to 738171 
to 73971 1 



to 740598 
to 741899 
to 742304 



to 742557 
to 742979 



to 743401 
to 743714 



Unknown, similar to CheA 

activity-modulating 
chemotaxis protein CheV 
potassium-transporting 
atpase c chain 
flagellin protein 
Chemotaxis response 

regulator CheY 
two-component sensor 
histidine kinase CheA 
Unknown, similar to 
flagellar motor switch 
protein fliY C-terminal part 
Unknown 
Unknown 
unknown, similar to 
flagellar hook assembly 
protein 
Unknown, similar to 
flagellar hook protein FlgE 

Unknown, similar to 
flagellar switch protein FliM 

UNknown, similar to 
flagellar motor switch 
protein fliY 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown, similar to 
flagellar hook-associated 
protein FlgK 
Unknown, similar to 
flagellar hook-associated 
protein 3 FlgL 
Unknown, similar to 
flagellar hook-associated 
protein 2 FliD 
Unknown, similar to 
hypothetical flagellar 
protein 
Unknown 
Unknown, similar to 
flagellar basal-body rod 
protein flgB 
Unknown, similar to 
flagellar basal-body rod 
protein flgC 
Unknown, similar to 
flagellar hook-basal body 
complex protein FliE 
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SeqID n° 2603 


LM-682.1 


From 743782 


to 745434 


SeqID n° 2604 


LM-685.1 


From 745437 


to 746543 


SeqID n° 2605 
SeqID n° 2606 


LM-686.1 
LM-688.1 


From 746530 
From 747219 


to 747222 
to 748520 


SeqID n° 2607 


LM-689.1 


From 748537 


to 749205 


SeqID n° 2608 


LM-69.1 


From 2753523 


to 2755568 


SeqID n° 2609 
SeqID n° 2610 


LM-690.1 
LM-691.1 


From 749219 
From 749985 


to 749863 
to 75031 1 


SeqID n° 2611 
SeqID n° 2612 


LM-692.1 
LM-693.1 


From 750311 
From 750680 


to 750634 
to 751327 


SeqID n° 2613 


LM-695.1 


From 751598 


to 753328 


SeqID n° 2614 


LM-697.1 


From 753490 


to 755295 


SeqID n° 2615 


LM-698.1 


From 755308 


to 756036 


SeqID n° 2616 


LM-699.2 


From 2842413 


to 2843867 


SeqID n° 2617 
SeqID n° 2618 


LM-7.1 
LM-700.1 


From 2716408 
From 2843914 


to 2717394 
to 2844216 


SeqID n° 2619 


LM-702.1 


From 2844249 


to 2845601 


SeqID n° 2620 


LM-703.1 


From 2845613 


to 2846497 


SeqID n° 2621 


LM-704.1 


From 2846490 


to 2846798 


SeqID n° 2622 


LM-705.1 


From 2846839 


to 2847573 


SeqID n° 2623 
SeqID n° 2624 
SeqID n° 2625 


LM-706.1 
LM-707.2 
LM-708.3 


From 2847673 
From 2848406 
From 2849563 


to 2848335 
to 2849536 
to 2850450 



Unknown, similar to 
flagellar basal-body M-ring 
protein fliF 
Unknown, similar to 
flagellar motor switch 
protein fliG 
Unknown 
Unknwon, similar to H+- 
transporting ATP synthase 
alpha chain Fill, flagellar- 
specific, - 
Unknown, similar to 
transglycosylase 
potassium-transporting 
atpase b chain 
Unknown 
Unknown, similar to 
unknown protein 
Unknown 
putative fibronectin-binding 
protein 
Unknown, similar to 
pyruvate oxidase 
Unknown, similar to metyl- 
accepting chemotaxis 

protein 
Unknown, similar to B. 
subtilis YvpB protein 
Unknown, similar to beta- 
glucosidase 
Unknown 
Unknown, similar to PTS 
cellobiose-specific enzyme 
MB 

Unknown, similar to PTS 
cellobiose-specific enzyme 

IIC component 
Unknown, similar to xylose 
operon regulatory protein 

and to glucose kinase 
Unknown, similar to PTS 
cellobiose-specific enzyme 
MA 

Unknown, similar to 
hypothetical transcriptional 
regulator 
Unknwon 
Unknwon 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 



WO 01/77335 



PCT/FR01/01118 



164 



SeqID n° 2626 LM-709.2 From 2850620 to 2852950 



SeqID n° 2627 LM-710.1 From 2852988 to 2854436 

SeqID n° 2628 LM-71 1 .2 From 2854429 to 2856282 

SeqID n° 2629 LM-712.2 From 2856379 to 2857218 

SeqID n° 2630 LM-714.1 From 2857580 to 2858203 

SeqID n° 2631 LM-715. 1 From 2858196 to 2860364 

SeqID n° 2632 LM-716.1 From 2860426 to 2860821 

SeqID n° 2633 LM-717.1 From 2861424 to 2862614 

SeqID n° 2634 LM-718.1 From 2862774 to 2863283 

SeqID n° 2635 LM-720. 1 From 2863561 to 2864661 

SeqID n° 2636 LM-721.1 From 2864813 to 2865121 

SeqID n° 2637 LM-723. 1 From 2865127 to 2867397 

SeqID n° 2638 LM-724.1 From 2867432 to 2867731 

SeqID n° 2639 LM-725. 1 From 2867746 to 2869041 

SeqID n° 2640 LM-726.1 From 2869191 to 2871107 



SeqID n° 2641 LM-727.1 From 2871318 to 2872784 
SeqID n° 2642 LM-728.1 From 2872932 to 2873915 
SeqID n° 2643 LM-73.1 From 2755580 to 2757265 



SeqID n° 2644 LM-730.1 From 2873915 to 2875837 



SeqID n° 2645 LM-731 2. From 2875945 to 2876775 
SeqID n° 2646 LM-733.2 From 2877160 to 287801 1 

SeqID n° 2647 LM-734. 1 From 2878004 to 2878765 



Unknown, similar to 
gamma-glutamylcysteine 

synthetase (for the 
NJerminal part) and to 
cyanophycin synthetase 

(C-terminal part) 
Unknown, similar to beta- 

glucosidase 
Unknown, similar to beta- 
glucoside-specific enzyme 
IIABC 
Unknwon, similar to 
transcription antiterminator 
Unknwon, similar to ABC 
transporter, ATP-binding 
protein 
Unknown 
Unknown 
Unknown, similar to efflux 
protein 
Unknown 
Unknown, similar to 
probable GTP-binding 
protein 
Unknown, similar to 
cellobiose PTS enzyme HA 
Unknown, similar to beta- 

glucosidase 
Unknown, similar to PTS, 
cellobiose-specific MB 

component 
Unknown, similar to 

cellobiose 
phosphotransferase 
system enzyme IIC 
Unknown, similar to 
lichenan operon 
transcription antiterminator 
licR 
catalase 
Unknown 
unknown, highly similar to 
potassium-transporting 

atpase a chain 
beta-glucoside-specific 
phosphotransferase 
enzyme II 
transcription antiterminator 
Partition protein ParB 

homolg 
Partition protein, ParA 
homolog 
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SeqID n° 2648 
SeqiD n° 2649 
SeqID n° 2650 



SeqID n° 2651 
SeqID n° 2652 

SeqID n° 2653 
SeqID n° 2654 

SeqID n° 2655 
SeqID n° 2656 

SeqID n° 2657 
SeqID n° 2658 

SeqID n° 2659 

SeqID n° 2660 

SeqID n° 2661 
SeqID n° 2662 
SeqID n° 2663 

SeqID n° 2664 
SeqID n° 2665 
SeqID n° 2666 
SeqID n° 2667 
SeqID n° 2668 
SeqID n° 2669 



LM-735. 1 From 2878989 to 2879747 
LM-736. 1 From 2879906 to 2880205 
LM-737.1 From 2880217 to 2881071 



LM-739.1 From 2881243 to 2882049 
LM-74. 1 From 2757606 to 27579 1 1 

LM-740.1 From 2882033 to 2882938 
LM-741 A From 2882963 to 2883409 

LM-742. 1 From 2883422 to 2884078 

LM-744.1 From 2884141 to 2885547 

LM-747.1 From 2885581 to 2886582 

LM-748.2 From 2886584 to 2887297 

LM-749.2 From 2536347 to 2538581 

LM-750. 1 From 2538588 to 25391 96 

LM-751.1 From 2539213 to 2539611 

LM-752. 1 From 2539838 to 25401 73 

LM-753. 1 From 2540421 to 2541 857 

LM-754.1 From 2542014 to 2542610 

LM-755. 1 From 2542658 to 2544049 

LM-758. 1 From 2545487 to 2546503 

LM-759. 1 From 2546530 to 2547501 

LM-760. 1 From 2547509 to 2548477 

LM-761 . 1 From 2548479 to 2549354 



Unknown 
Unknown 
Unknown, highly similar to 

B. subtilis DNA-binding 
protein SpoOJ-like homolog 
YyaA 

Unknown, similar to E. coli 
RpiR transcription regulator 
Unknown, similar to 

cellobiose 
phosphotransferase 
enzyme IIB component 

Unknown, similar to 
transcription regulator 
Unknown, similar to 
phosphotransferase 
system mannitol-specif/c 
enzyme IIA 
Unknown, similar to 

phosphatase 
Unknown, similar to 
phosphotransferase 
system mannitol-specific 
enzyme IIBC 
Unknown, similar to 

dehydrogenase 
Unknown, similar to a 

putative N- 
acetyImannosamine-6- 
phosphate epimerase 
unknown, similar to 

transport protein 
Unknown, similar to 
transcription regulator 
Unknown 
Unknown 
Unknown, similar to 
chitinase and chitin binding 
protein 
ATP-dependent Clp 
protease proteolytic subunit 
Unknown, similar to amino 

acid transporter 
Unknwon, similar to NADH 
oxidase 
Unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
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SeqID n° 2670 


LM-762.1 


From 2549462 


to 2551192 


SealD n° 2671 


I M-763 1 


From 19*34 

1 1 Ull 1 £L\JkJ 1 Z. JH 




SeqID n° 2672 
SealD n° 2673 
SeqID n° 2674 
SeqID n° 2675 


LM-764.1 
LM-765 1 
LM-766.1 
LM-767.1 


From 2552312 
From ? l S c ;'3414 

From 2554452 
From 2556013 


to 2553295 

to 2555927 
to 2556510 


SeqID n° 2676 


LM-768.1 


From 2556514 


to 2557167 


SeqID n° 2677 


LM-769.1 


From 2557211 


to 2558044 


SeqID n° 2678 


LM-77.1 


From 2758046 


to 2759353 


SeqID n° 2679 


LM-771.1 


From 2558130 


to 2559068 


SeqID n° 2680 


LM-772.1 


From 2559255 


to 2559608 


SeqID n° 2681 
SeqID n° 2682 


LM-774.1 
LM-776.1 


From 2559841 
From 2561120 


to 2561052 
to 2562382 


SeqID n° 2683 


LM-778.1 


From 2562591 


to 2565461 


SeqID n° 2684 


LM-78.1 


From 2759390 


to 2759692 


SealD n° 2685 


LM-780 1 


From P'SR^RQ 




SeqID n° 2686 
SeqID n° 2687 
SeqID n° 2688 


LM-781.1 
LM-782.1 
LM-783.1 


From 2567956 
From 2568625 
From 2569117 


to 2568603 
to 2569011 
to 2569422 


SeqID n° 2689 


LM-784.1 


From 2569690 


to 2570349 


SeqID n° 2690 


LM-786.1 


From 2570362 


to 2571141 


SeqID n° 2691 


LM-788.1 


From 2571156 


to 2571971 


SeqID n° 2692 


LM-789.1 


From 2571999 


to 2572883 



Unknown, similar to 
phosphomannomutase and 

phosphoglucomutase 
Unknown, similar to aldose 
1-epimerase (mutarotase) 
UDP-glucose 4-epimerase 
thioredoxin reductase 

Unknown 
Unknown, similar to 
acetyltransferase 
Unknown, similar to B. 
subtilis P-Ser-HPr 
phosphatase 
unknown, highly similar to 

prolipoprotein 
diacylglyceryl transferase 
Unknown, similar to 

ceilobiose 
phosphotransferase 
enzyme IIC component 
HPr-P(Ser) 
kinase/phosphatase 
Unknown, similar to B. 
subtilis YvID protein 
unknown 
Unknown, similar to B. 
subtilis YvlB protein 
excinuclease ABC (subunit 
A) 

Unknown, similar to 

ceilobiose 
phosphotransferase 
enzyme IIA component 
excinuclease ABC (subunit 
B) 

Unknown 
unknown 
Unknown, similar to 
transcription regulator ArsR 
family 
Unknown, similar to 
negative regulator of 
phosphate regulon 
unknown, similar to 
phosphate ABC transporter 
(ATP-binding protein) 
unknown, similar to 
phosphate ABC transporter 
(ATP-binding protein) 
Unknown, similar to 
phosphate ABC transporter 
(permease protein) 
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SeqID n° 2693 LM-791 A From 2572880 to 2573803 



SeqID n° 2694 LM-793. 1 From 2573897 to 2574805 



SeqID n° 2695 

SeqID n° 2696 

SeqID n° 2697 
SeqID n° 2698 



LM-794.1 

LM-795.1 

LM-796.2 
LM-797.2 



From 2575061 

From 2576836 

From 2577696 
From 2578899 



to 2576836 

to 2577546 

to 2578871 
to 2580347 



SeqID n° 2699 LM-799.2 From 2278639 to 2279292 



SeqID n° 2700 
SeqID n° 2701 
SeqID n° 2702 



LM-8.1 
LM-80.1 
LM-801.1 



From 2717422 
From 2760147 
From 2277398 



to 2718234 
to 2760680 
to 2278513 



SeqID n° 2703 LM-803.1 From 2275520 to 2277325 

SeqID n° 2704 LM-804. 1 From 2274961 to 2275221 

SeqID n° 2705 LM-805.1 From 2274127 to 2274750 

SeqID n° 2706 LM-806.1 From 2272403 to 22741 12 

SeqID n° 2707 LM-807.1 From 2271444 to 2272316 



SeqID n° 2708 LM-809.1 From 2270465 to 2271454 

SeqID n° 2709 LM-81 .1 From 2760845 to 2761954 

SeqID n° 2710 LM-810.1 From 2269705 to 2270484 

SeqID n° 271 1 LM-812.1 From 2268961 to 2269701 

SeqID n° 2712 LM-813.2 From 2268424 to 2268900 

SeqID n° 2713 LM-817.1 From 2259366 to 2259713 

SeqID n° 2714 LM-818.1 From 2258697 to 2259260 

SeqID n° 2715 LM-820.1 From 2257754 to 2258515 

SeqID n° 2716 LM-821 .1 From 2256507 to 2257580 



Unknown, similar to 
phosphate ABC transporter 
(permease protein) 
Unknown, similar to 
phosphate ABC transporter 
(binding protein) 
two-component sensor 
histidine kinase 
two-component response 
regulator 
Unknown 
Unknown, similar to 
cardiolipin synthase 
competence negative 
regulator mecA 
Unknown 
unknown 
Unknown, similar to a 
putative competence 
protein from streptococcus 
pneumoniae 
Unknown, similar to 
oligoendopeptidase 
Unknown 
Unknown 
Unknown 
Unknown, similar to 

ferrichrome ABC 
transporter (binding 

protein) 
Unknown, similar to 
ferrichrome ABC 
transporter (permease) 
Unknown, simlilar to cell 
division protein FtsW 
Unknown, similar to 
ferrichrome ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
transcriptional regulator 
(tetR family) 
Unknown, similar to 

dehydrogenase 
Unknown, similar to 
unknown proteins 
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SeqlDn°2717 LM-823.1 From 2255034 to 2256401 



SeqlDn°2718 LM-826.1 From 2253221 to 2254804 



SeqID n" 2719 

SeqID n° 2720 

SeqID n° 2721 

SeqID n° 2722 

SeqID n° 2723 

SeqID n° 2724 
SeqID n° 2725 



LM-827.1 From 2252001 to 2253224 

LM-83.1 From 2761951 to 2763081 

LM-830.1 From 2251050 to 2251979 

LM-831.1 From 2250431 to 2250820 

LM-833.1 From 2249683 to 2250300 

LM-834.1 From 2248946 to 2249617 

LM-835.1 From 2247998 to 2248693 



SeqID n° 2726 LM-836.1 From 2247050 to 2247928 



SeqID n° 2727 

SeqID n° 2728 

SeqID n° 2729 
SeqID n° 2730 



LM-837.1 

LM-838.1 

LM-839.1 
LM-840.1 



From 2245899 

From 2245143 

From 2244419 
From 2243448 



to 2246975 

to 2245883 

to 2245141 
to 2244416 



SeqID n° 2731 LM-841.1 From 2242376 to 2243425 



SeqID n° 2732 
SeqID n° 2733 
SeqID n° 2734 



LM-842.1 
LM-843.1 
LM-845.1 



From 2240068 
From 2239672 
From 2236844 



to 2241969 
to 2240013 
to 2239135 



SeqID n° 2735 LM-846.1 From 2235739 to 2236788 



SeqID n° 2736 
SeqID n° 2737 
SeqID n° 2738 
SeqID n° 2739 
SeqID n° 2740 
SeqID n° 2741 



LM-847.1 From 2235305 to 2235742 

LM-848.1 From 2234982 to 2235308 

LM-849.1 From 2234575 to 2234874 

LM-85.1 From 2763082 to 2765652 

LM-850.1 From 2233841 to 2234158 

LM-851.1 From 2233150 to 2233791 



Unknown, similar to sigma- 
54-dependent 
transcriptional activator 
Unknown, similar to 
propionate CoA- 
transferase 
Unknown, similar to 
antiporter proteins 
Unknown, simlilar to cell 
division protein FtsW 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

glyoxalase I 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
transcription regulator 
CRP/FNR family 
Unknown, similar to 
transcriptional regulator 
(AraC/XylS family) 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
oxidoreductase 
Unknown 
Unknown 
Unknown, similar to 
ribonucleoside-diphosphate 
reductase, subunit alpha 
Unknown, similar to 
ribonucleoside-diphosphate 
reductase, subunit beta 
Unknown, similar to 

flavodoxin 
Unknown, similar to 

thioredoxin 
Unknown, similar to 
unknown proteins 
Unknown, highly similar to 
Mg2+ transport ATPase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
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SeqID n° 2742 LM-853. 1 From 2232087 to 2233094 

SeqID n° 2743 LM-854.1 From 2231127 to 2231981 

SeqID n° 2744 LM-856.1 From 2230042 to 2231058 

SeqID n° 2745 LM-857.1 From 2229193 to 2229927 

SeqID n° 2746 LM-858.1 From 2227412 to 2229154 



SeqID n° 2747 LM-859. 1 From 2226533 to 2227198 
SeqID n° 2748 LM-860.1 From 2225471 to 2225944 

SeqID n° 2749 LM-861.1 From 2224219 to 2225454 



SeqID n° 2750 LM-862.1 From 2223324 to 2224226 

SeqID n° 2751 LM-863.1 From 2221671 to 2223128 

SeqID n° 2752 LM-865.1 From 2221186 to 2221659 

SeqID n° 2753 LM-866.1 From 2220862 to 2221 173 

SeqID n° 2754 LM-867.1 From 2219751 to 2220845 

SeqID n° 2755 LM-869.1 From 2218879 to 2219733 

SeqID n° 2756 LM-87. 1 From 2766207 to 2766773 

SeqID n° 2757 LM-870.1 From 2217963 to 2218862 



SeqID n° 2758 LM-871.1 From 2217222 to 2217953 
SeqID n° 2759 LM-872.1 From 2216054 to 2216743 
SeqID n° 2760 LM-874.2 From 2486136 to 2486921 



SeqID n° 2761 LM-875.1 From 2484816 to 2486117 
SeqID n° 2762 LM-876.1 From 2483589 to 2484815 



Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulator LysR 
family 
Unknown, similar to 

unknown protein 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, weakly similar to 
mannose-6-phosphate 
isomerase 
Unknown 
Unknown, similar to 
unknown protein 
Unknown, similar to ABC 
transporter (membrane 

protein) 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
transcription regulator 
Unknown, similar to PTS 
system, fructose-specific 
enzyme IIA component 
Unknown, similar to PTS 
system, fructose-specific 
enzyme IIB component 
Unknown, similar to PTS 
system, fructose-specific 
enzyme IIC component 

Unknown, similar to 
fructose~1 ,6-biphosphate 
aldolase type II 
Unknown, similar to 
transcription regulator, 
TetR family 
Unknown, similar to 
fructose- 1,6-biphosphate 
aldolase type II 
Unknown 
Unknown 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 
aminotransferase 
Unknown, similar to 
aminotransferase 
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SeqID n° 2763 LM-877.1 From 2483149 to 2483592 



SeqID n° 2764 LM-878.1 From 2481732 to 2483126 



SeqID n° 2765 
SeqID n° 2766 
SeqID n° 2767 
SeqID n° 2768 

SeqID n° 2769 
SeqID n° 2770 



LM-879.1 
LM-880.1 
LM-881.1 
LM-883.1 

LM-884.1 
LM-885.1 



From 2481023 
From 2480531 
From 2479607 
From 2478623 

From 2478225 
From 2477385 



to 2481574 
to 2480944 
to 2479921 
to 2479465 

to 2478590 
to 2478224 



SeqID n° 2771 LM-886. 1 From 2475860 to 2477251 

SeqID n° 2772 LM-887.1 From 2475588 to 2475863 

SeqID n° 2773 LM-888.1 From 2474801 to 2475568 

SeqID n° 2774 LM-889. 1 From 2474309 to 2474752 

SeqID n° 2775 LM-89. 1 From 2766935 to 2768707 

SeqID n° 2776 LM-890. 1 From 2472963 to 2474273 

SeqID n° 2777 LM-892.1 From 2472438 to 2472938 

SeqID n° 2778 LM-893. 1 From 2472089 to 2472325 

SeqID n° 2779 LM-896.1 From 2468210 to 2468392 

SeqID n° 2780 LM-897. 1 From 2467643 to 2467969 

SeqID n° 2781 LM-899.1 From 2466713 to 2467342 



SeqID n° 2782 LM-90.1 From 2768944 to 2769273 
SeqID n° 2783 LM-901 A From 2465630 to 2466625 



SeqID n° 2784 LM-903.1 From 2463969 to 2465180 
SeqID n° 2785 LM-904.1 From 2463066 to 2463884 



Unknown, similar to 
conserved hypothetical 
proteins 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown 
Unknown 
unknown 
unknown, similar to B. 
subtilis YunF protein 

Unknown 
Unknown, similar to 
conserved hypothetical 

proteins 
Unknown, similar to B. 
subtilis YunD protein 
Unknown, similar to B. 
subtilis YutD protein 
Unknown, similar to 
conserved hypothetical 
protein and to B. subtilis 
YutF protein 
Unknown, similar to 
acetyltransferase 
Unknown, similar to 

autolysin, N- 
acetylmuramidase 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown, low temperature 
requirement C protein, also 
similar to B. subtilis YutG 

protein 
Unknown, similar to NifU 

protein 
Unknown, hypothetical 
CDS 

Unknown, similar to B. 
subtilis YuzD protein 
Unknown, conserved 
hypothetical protein similar 
to B. subtilis YhfK protein 
Unknown 
Unknown, similar to 
hypothetical thioredoxine 

reductase 
Unknown, similar to NADH 
dehydrogenase 
Unknown, similar to B. 
subtilis YwqG protein 
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SeqID n° 2786 


LM-906.1 


From 2461803 


to 2463029 


o^#iir"\ n o 1707 


LM-907. 1 


From 2461219 


to 2461692 


SeqID n° 2788 


LM-908.1 


From 2460720 


to 2461091 


SeqID n° 2789 


LM-91.1 


From 2769381 


to 2770007 


SeqID n° 2790 


LM-910.1 


From 2460090 


to 2460683 


SeqID n° 2791 


LM-911.1 


From 2459819 


to 2460106 


SeqID n° 2792 


LM-912.1 


From 2459343 


to 2459822 


SeqID n° 2793 


LM-913.1 


From 2457852 


to 2459336 


SeqID n° 2794 


LM-915.1 


From 2457512 


to 2457859 



SeqID n° 2795 LM-916.1 From 2457087 to 2457512 



SeqID n° 2796 LM-917.1 From 2454695 to 2457103 



SeqID n° 2797 

SeqID n° 2798 

SeqID n° 2799 
SeqID n°2800 



LM-918.1 From 2453138 to 2454349 

LM-91 9. 1 From 2452322 to 2452906 

LM-92.1 From 2770060 to 2771439 

LM-920. 1 From 2451840 to 2452235 



Unknown, conserved 
hypothetical protein 
Unknown, similar to B. 

subtilis YuiD protein 
Unknown, similar to B. 
subtilis YuxO protein 
Unknown, similar to 
thymidylate kinase 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
Unknown, similar to multi- 
drug resistance efflux 

pump 
Unknown, similar to 
peptidyl-prolyl cis-trans 
isomerase 
Unknown, similar to lysine 
decarboxylase 
Unknown 
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SeqID n° 2801 LM-922.1 From 2450437 to 2451798 

SeqiD n° 2802 LM-923. 1 From 2449878 to 24501 92 

SeqID n° 2803 LM-924.1 From 2449155 to 2449835 

SeqID n° 2804 LM-925. 1 From 2448052 to 24491 55 

SeqID n° 2805 LM-926. 1 From 2446747 to 244791 3 

SeqID n° 2806 LM-927. 1 From 2446240 to 2446593 



SeqID n° 2807 LM-929.1 From 2445685 to 2446092 
SeqID n° 2808 LM-930.2 From 2444231 to 2445583 

SeqID n° 2809 LM-931.2 From 2443368 to 2444126 



SeqID n° 2810 LM-935.1 From 198201 to 199409 



SeqID n° 2811 LM-936.1 From 197512 to 198204 

SeqID n° 2812 LM-937.1 From 196786 to 197463 
SeqID n° 2813 LM-938.1 From 195793 to 196611 



SeqID n° 2814 LM-939.1 From 194892 to 195629 

SeqID n° 2815 LM-94.1 From 2771623 to 2772612 

SeqID n° 2816 LM-941.1 From 193989 to 194870 

SeqID n° 2817 LM-942.1 From 193593 to 193850 

SeqID n° 2818 LM-943.1 From 192586 to 193473 

SeqID n° 2819 LM-945.1 From 192018 to 192593 

SeqID n° 2820 LM-946.1 From 190690 to 191916 



Unknown, similar to 

aspartate kinase 
Unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 

IIB component 
unknown, similar to ABC- 
transporter ATP binding 
proteins 
Unknown, similar to 
putative ABC-transporter 
transmembrane subunit 
Unknown, similar to 
aminotransferase 
Unknown, similar to B. 
subtilis general stress 
protein 13 containing a 
ribosomal S1 protein 
domain 
Unknown 
glucose-6-phosphate 

isomerase 
Unknown, similar to 
transcription regulator 
DeoR family 
Unknown, similar to ABC 
transporter, ATP-binding 

protein 
ABC transporter, ATP- 
binding protein 
unknwon 
Unknown, similar to PurR, 
transcription repressor of 
purine operon of B. subtilis 
Unknwon, similar to a 
putative phospho-beta- 
glucosidase 
Unknown, similar to 
dihydroxyacetone kinase 
Unknwon, similar to B. 
subtilis YabH protein 
Unknwon, highly similar to 
B. subtilis Veg protein 

dimethyladenosine 
transferase (16S rRNA 

dimethylase) 
Unkwnon, similar to B. 
subtilis YabF protein 
Unknwon, similar to B. 
subtilis YabE protein 
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SeqID n° 2821 LM-947.1 From 189625 to 190398 



SeqID n° 2822 
SeqID n° 2823 

SeqID n° 2824 
SeqID n° 2825 



LM-949.1 From 187867 to 189528 

LM-95.1 From 2772634 to 2773230 

LM-950.1 From 185572 to 187863 

LM-955.1 From 182267 to 185569 



SeqID n° 2826 LM-958.1 From 180928 to 182184 



SeqID n° 2827 LM-96.1 From 2773234 to 2773608 



SeqID n° 2828 


LM-960.1 


From 


180052 


to 


180900 


SeqID n° 2829 


LM-961.1 


From 


179174 


to 


180052 


SeqID n° 2830 


LM-962.1 


From 


177924 


to 


179138 


SeqID n° 2831 


LM-964.1 


From 


175766 


to 


177760 


SeqID n° 2832 


LM-965.1 


From 


174834 


to 


175694 


SeqID n° 2833 


LM-968.1 


From 


172903 


to 


173208 


SeqID n° 2834 


LM-969.1 


From 


172410 


to 


172802 



SeqID n° 2835 

SeqID n° 2836 

SeqID n° 2837 
SeqID n° 2838 

SeqID n° 2839 
SeqID n° 2840 
SeqID n° 2841 



LM-97. 1 From 2773714 to 2774565 

LM-970. 1 From 1 72072 to 1 7241 0 

LM-975. 1 From 1 68005 to 1 69270 

LM-977. 1 From 1 67086 to 1 67943 

LM-978.1 From 166687 to 166971 

LM-98. 1 From 2774684 to 2775523 

LM-981 .1 From 165761 to 166642 



Unknwon, similar to 
conserved hypothetical 
proteins 
Unknwon, similar to oligo- 
1,6-gIucosidase 
Unknown, similar to 
hypothetical 
dihydroxyacetone kinase 
Unknwon, similar to alpha- 

glucosidase 
Unknwon, similar to alpha- 
xylosidase and alpha- 
glucosidase 
Unknwon, similar to sugar 

ABC transporter, 
periplasmic sugar-binding 
protein 
unknown 
Unknwon, similar to sugar 
ABC transporter, permease 

protein 
Unknwon, similar to sugar 
ABC transporters, 
permease proteins 
Unknwon, similar to xylose 

repressor 
methionyl-tRNA synthetase 
Unknwon, similar to 
glucose uptake protein 
Unknown, similar to 

transposase 
Unknown, similar to 
transposase (N-terminal 
part) 

Unknown, similar to 
putative transcription 

regulator 
Unknown, similar to 
transposase C-terminal 
part 
Unknwon 
Unknown, similar to a 
glucose uptake protein 
Unknown, similar to B. 

subtiiis transcription 
regulatory protein AbrB 

Unknown, similar to 
conserved hypothetical 

protein 
Unknwon, conserved 
hypothetical protein 
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9849 


I M-QR9 1 


n ui 1 1 




SeqID n° 


2843 


LM-983.1 


From 


164756 


SeqID n° 


2844 


LM-984.1 


From 


164309 


SeqID n° 
SeqID n° 


2845 
2846 


LM-987.1 
LM-988.2 


From 
From 


163465 
162467 


SeqID n° 


2847 


LM-990.2 


From 


596580 


SealD n° 


2848 


LM-991 1 


From 




SeqID n° 


2849 


LM-992.1 


From 


595135 


SeqID n° 


2850 


LM-994.1 


From 


593528 


SeqID n° 


2851 


LM-995.1 


From 


592257 


SeqID n° 
SeqID n° 


2852 
2853 


LM-996.1 
LM-997.1 


From 
From 


591412 
590324 


SeqID n° 


2854 


LM-998.1 


From 


589406 



to 165764 unknwon, similar to B. 

subtilis YazA protein 
to 165508 unknwon, conserved 
hypothetical protein 
to 164698 Unknwon, similar to B. 

subtilis YabA protein 
to 164298 Unknwon 
to 163459 Unknwon, similar to B. 

subtilis DNA polymerase III 
(delta subunit) 
to 597620 Unknown, conserved 
hypothetical protein 
to 596538 Unknown, similar to 

phosphoglycerate mutase 
to 595842 Unknown, similar to 

phosphoglycerate mutase 
to 595006 Unknown, similar to di- 
tripeptide ABC transporter 
(membrane protein) 
to 593438 Unknown, similar to NADH- 
dependent butanol 
dehydrogenase 
to 592047 Unknown 
to 591355 Unknown, similar to 

unknown protein 
to 590248 Unknown 
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REVENDICATIONS 

1. Sequence nucleotidique de Listeria monocytogenes caracterisee en ce qu'elle 
correspond a SEQ ID N° 1 . 
5 2. Sequence nucleotidique de Listeria monocytogenes, caracterisee en ce qu'elle 
est choisie parmi : 

a) une sequence nucleotidique comportant au moins 80 % d'identite avec 
SEQ ID N° 1 ; 

b) une sequence nucleotidique hybridant dans des conditions de forte 
1 0 stringence avec SEQ ID N° 1 ; 

c) une sequence nucleotidique complementaire de SEQ ID 
N° 1 ou complementaire d'une sequence nucleotidique telle que definie 
en a), ou b\ ou une sequence nucleotidique de TARN correspondant ; 

d) une sequence nucleotidique de fragment representatif de SEQ ID N° 1, 
15 ou de fragment representatif d'une sequence nucleotidique telle que 

definie en a) ? b) ou c); 

e) une sequence nucleotidique comprenant une sequence telle que definie 
en a), b), c) ou d) ; et 

f) une sequence nucleotidique modifiee d'une sequence nucleotidique telle 
20 que definie en a), b), c), d) ou e). 

3. Sequence nucleotidique selon la revendication 2, caracterisee en ce qu'il s'agit 
d'une sequence issue de SEQ ID N° 1, et en ce qu'elle code pour un polypeptide 
choisi parmi les polypeptides de sequence SEQ ID N° 1 a SEQ ID N° 2854. 

4. Sequence nucleotidique caracterisee en ce qu'elle comprend une sequence 
25 nucleotidique choisie parmi : 

a) une sequence nucleotidique selon la revendication 3 ; 

b) une sequence nucleotidique comportant au moins 80 % d'identite avec 
une sequence nucleotidique selon la revendication 3 ; 

c) une sequence nucleotidique s'hybridant dans des conditions de forte 
30 stringence avec une sequence nucleotidique selon la revendication 3 ; 

d) une sequence nucleotidique complementaire ou d'AKN correspondant a 
une sequence telle que definie en a), b) ou c) ; 
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e) une sequence nucleotidique de fragment representatif d'une sequence 
telle que definie en a), b), c) ou d) ; et 

f) une sequence nucleotidique modifiee d'une sequence telle que definie en 
a), b), c), d) ou e). 

5 5. Polypeptide code par une sequence nucleotidique selon Tune des revendi cations 
2 a 4. 

6. Polypeptide selon la revendication 5, caracterise en ce qu'il est choisi parmi les 
sequences SEQ ID N° 2 a SEQ ID N° 2854. 

7. Polypeptide caracterise en ce qu'il comprend un polypeptide choisi parmi : 
10 a) un polypeptide selon Tune des revendications 5 et 6 ; 

b) un polypeptide presentant au moins 80 % d'identite avec un polypeptide 
selon Tune des revendications 5 et 6 ; 

c) un fragment d'au moins 5 acides amines d'un polypeptide selon Tune 
des revendications 5 et 6, ou tel que defini en b) ; 

15 d) un fragment biologiquement actif d'un polypeptide selon Tune des 

revendications 5 et 6, ou tel que defini en b) ou c) ; et 
e) un polypeptide modifie d'un polypeptide selon Tune des revendications 
5 et 6 ou tel que defini en b) 3 c) ou d). 

8. Sequence nucleotidique codant pour un polypeptide selon la revendication 7. 
20 9. Sequence nucleotidique selon Tune des revendications 2 a 4 7 et 8, caracterisee 

en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans la biosynthese des acides amines ou Tun de ses fragments. 

10. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

25 dans la biosynthese des cofacteurs, groupes prosthetiques et transporters ou 
Tun de ses fragments. 

11. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide d'enveloppe cellulaire ou situe a la 
surface de Listeria monocytogenes ou Tun de ses fragments, ledit polypeptide 

30 etant de preference SEQ ID N° 2 a SEQ ID N° 41 . 

12. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans la machinerie cellulaire ou 1'un de ses fragments. 
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13. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le metabolisme intermediate central ou Tun de ses fragments. 

14. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
5 en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

dans le metabolisme energenique ou Tun de ses fragments. 

15. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le metabolisme des acides gras et des phospholipides ou Tun de ses 

1 0 fragments. 

16. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le metabolisme des nucleotides, des purines, des pyrimidines ou 
nucleosides ou Tun de ses fragments. 

15 17. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans les fonctions de regulation ou Tun de ses fragments. 

18. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

20 dans le processus de replication ou Tun de ses fragment. 

19. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le processus de transcription ou l'un de ses fragments. 

20. Sequence nucleotidique selon 1'une des revendications 2 a 4, et 8, caracterisee 
25 en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

dans le processus de traduction ou l'un de ses fragments. 

21. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le processus de transport et de liaison des proteines ou l'un de ses 

30 fragments. 

22. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans 1'adaptation aux conditions atypiques ou l'un de ses fragments. 
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23. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans la sensibilite aux medicaments et analogues ou Tun de ses fragments. 

24. Sequence nucleotidique caracterisee en ce qu'elle code pour un polypeptide de 
5 Listeria monocytogenes implique dans la biosynthese de la vitamine B12, ledit 

polypeptide etant de preference SEQ ID N° 42 a SEQ ID N°64. 

25. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans les fonctions relatives aux transposons ou l'un de ses fragments. 

10 26. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide specifique de Listeria monocytogenes ou 
l'un de ses fragments. 

27. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la biosynthese des 

1 5 acides amines ou l'un de ses fragments. 

28. Polypeptide selon l'une des revendications 5a7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la biosynthese des 
cofacteurs, groupes prosthetiques et transporters ou l'un de ses fragment. 

29. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
20 d'un polypeptide d'enveloppe cellulaire ou situe a la surface de Listeria 

monocytogenes ou l'un de ses fragments, ledit polypeptide etant de preference 
SEQ ID N° 2 a SEQ ID N° 41. 

30. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la machinerie 

25 cellulaire ou l'un de ses fragments. 

31. Polypeptide selon l'une des revendications 5a7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le metabolisme 
intermediate central ou Tun de ses fragements. 

32. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
30 d'un polypeptide de Listeria monocytogenes implique dans le metabolisme 

energetique ou l'un de ses fragments. 
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33. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le metabolisme des 
acides gras et des phospholipides ou Tun de ses fragments. 

34. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
5 d'un polypeptide de Listeria monocytogenes implique dans le metabolisme des 

nucleotides, des purines, des pyrimidines ou nucleosides ou Tun de ses 
fragments. 

35. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans les fonctions de 

1 0 regulation ou Tun de ses fragments. 

36. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le processus de 
replication ou l'un de ses fragments. 

37. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
15 d'un polypeptide de Listeria monocytogenes implique dans le processus de 

transcription ou l'un de ses fragments. 

38. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le processus de 
traduction ou Tun de ses fragments. 

20 39. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le processus de 
transport et de liaison des proteines ou l'un de ses fragments. 

40. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans 1 'adaptation aux 

25 conditions atypiques ou l'un de ses fragments. 

41. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la sensibilite aux 
medicaments et analogues ou l'un de ses fragments. 

42. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
30 d'un polypeptide de Listeria monocytogenes implique dans la biosynthese de la 

vitamine B12, ledit polypeptide etant de preference SEQ ID N° 42 a SEQ ID 
N°64. 
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43. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans les fonctions 
relatives aux transposons ou Tun de ses fragments. 

44. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
5 d'un polypeptide specifique de Listeria monocytogenes ou l'un de ses 

fragments. 

45. Sequence nucleotidique selon 1'une des revendications 1 a 4, et 8 a 26 , et/ou 
sequence de polypeptide selon l'une des revendications 5 a 7, et, 27 a 44, 
caracterisee(s) en ce que ladite ou lesdites sequences est (sont) enregistree(s) sur 

10 un support d'enregistrement dont la forme et la nature facilitent la lecture, 
F analyse et/ou F exploitation de ladite ou desdites sequences. 

46. Sequence nucleotidique ou sequence de polypeptide selon la revendication 45, 
caracterisee en ce que le support d'enregistrement est un CD-ROM, une 
disquette informatique ou un serveur informatique, 

15 47. Support d'enregistrement caracterise en ce qu'y est enregistre une sequence 

nucleotidique ou de polypeptide selon la revendication 45 ou 46. 
48. Utilisation d'un support selon la revendication 47 pour le choix d' amorces ou de 

sondes nucleotidiques pour la determination de genes dans des souches proches 

de Listeria monocytogenes. 
20 49. Utilisation d'un support selon la revendication 47 pour l'etude du 

polymorphisme genetique de souches proches de Listeria monocytogenes. 
50. Utilisation d'un support selon la revendication 47, pour l'etude d'autres 

genomes en particulier pour F annotation automatique de genes provenant 

d'autres genomes. 

25 51. Sequence nucleotidique utilisable comme amorce ou comme sonde, caracterisee 

en ce que ladite sequence est choisie parmi les sequences nucleotidiques selon 

Fune des revendications 2 a 4, et, 8 a 26. 
52. Sequence nucleotidique selon la revendication 51, caracterisee en ce qu'elle est 

marquee par un compose radioactif ou par un compose non radioactif 
30 53. Sequence nucleotidique selon Fune des revendications 51 et 52, caracterisee en 

ce qu'elle est immobilisee sur un support, de maniere covalente ou non- 

covalente. 



WO 01/77335 



PCT/FR01/01118 



181 

54. Sequence nucleotidique selon la revendication 53, caracterisee en ce qu'elle est 
immobilisee sur un support tel qu'un filtre a haute densite ou une puce a ADN. 

55. Sequence nucleotidique selon Tune des revendications 52 a 55 pour la detection 
et/ou F amplification de sequences nucleiques. 

5 56. Puce a ADN ou filtre, caracterise en ce qu'il contient au moins une sequence 
nucleotidique selon la revendication 55. 
57. Puce a ADN ou filtre selon la revendication 56, caracterise en ce qu'il contient 
en outre au moins une sequence nucleotidique d'un micro-organisme autre que 
Listeria monocytogenes, immobilisee sur le support de ladite puce. 
10 58. Puce a ADN ou filtre selon la revendication 57, caracterise en ce que le micro- 
organisme autre est choisi parmi un micro-organisme associe a Listeria 
monocytogenes, une bacterie du genre Listeria, et un variant de Listeria 
monocytogenes. 

59. Kit ou necessaire pour la detection et/ou Identification de bacteries appartenant 
15 a Fespece Listeria monocytogenes ou a un micro-organisme associe, caracterise 

en ce qu'il comprend une puce a ADN ou un filtre selon la revendication 56. 

60. Kit ou necessaire pour la detection et/ou F identification d'un micro-organisme, 
caracterise en ce qu'il comprend une puce a ADN ou un filtre selon Tune des 
revendications 57 et 58. 

20 61. Kit ou necessaire pour la detection et/ou la quantification de Fexpression d'au 
moins un gene de Listeria monocytogenes, caracterise en ce qu'il comprend une 
puce a ADN ou un filtre selon Tune des revendications 56 a 58. 
62. Vecteur de clonage, et/ou d'expression, caracterise en ce qu'il contient une 
sequence nucleotidique selon Fune des revendications 2 a 4 et 8 a 26. 

25 63. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
ce qu'il contient une sequence nucleotidique selon la revendication 11. 

64. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
ce qu'il contient une sequence nucleotidique selon la revendication 14. 

65. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
30 ce qu'il contient une sequence nucleotidique selon la revendication 9, 19 ou 20. 

66. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
ce qu'il contient une sequence nucleotidique selon la revendication 12, en 
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particulier codant pour une proteine impliquee dans les mecanismes de 
secretion. 

67. Vecteur de clonage, et/ou d' expression selon la revendication 62, caracterise en 

ce qu'il contient une sequence nucleotidique selon la revendication 24. 
5 68. Vecteur de clonage, et/ou d' expression selon la revendication 62, caracterise en 

ce qu'il contient une sequence nucleotidique codant pour une proteine impliquee 

dans la resistance et/ou P adaptation au stress. 
69. Cellule hote, caracterisee en ce qu'elle est transformee par un vecteur selon 

Tune des revendications 62 a 68. 
10 70. Cellule hote selon la revendication 69, caracterisee en ce qu'il s'agit d'une 

bacterie appartenant au genre Listeria. 
7L Cellule hote selon la revendication 70, caracterisee en ce qu'il s'agit d'une 

bacterie appartenant a Fespece Listeria monocytogenes. 

72. Vegetal ou animal, excepte PHomme, comprenant une cellule transformee selon 
1 5 Tune des revendications 69 a 71 . 

73. Precede de preparation d'un polypeptide, caracterise en ce que Ton cultive une 
cellule transformee par un vecteur selon la revendication 62 dans des conditions 
permettant 1' expression dudit polypeptide et que Ton recupere ledit polypeptide 
recombinant. 

20 74. Polypeptide recombinant susceptible d'etre obtenu par un procede selon la 
revendication 73. 

75. Procede de preparation d'un polypeptide synthetique selon 1'une des 
revendications 5 a 7, 27 a 44, caracterise en ce que 1'on effectue une synthese 
chimique dudit polypeptide. 
25 76. Polypeptide hybride, caracterise en ce qu'il comprend au moins la sequence 
d'un polypeptide selon l'une des revendications 5 a 7, 27 a 44 et 74, et une 
sequence d'un polypeptide susceptible d'induire une reponse immunitaire chez 
l'homme ou F animal. 

77. Sequence nucleotidique codant pour un polypeptide hybride selon la 
30 revendication 76. 

78. Vecteur caracterise en ce qu'il contient une sequence nucleotidique selon la 
revendication 77. 
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79. Anticorps monoclonal ou polyclonal, ses fragments, ou anticorps chimerique, 
caracterise en ce qu'il est capable de reconnaitre specifiquement un polypeptide 
selon Tune des revendications 5 a 7, 27 a 44, 74 ou 76. 

80. Anticorps selon la revendication 79, caracterise en ce qu'il s'agit d'un anticorps 
5 marque. 

81. Procede pour la detection et/ou {'identification de bacteries appartenant a 
1'espece Listeria monocytogenes ou a un microorganisme associe dans un 
echantillon biologique, caracterise en ce qu'il comprend les etapes suivantes : 

a) mise en contact de T echantillon biologique avec un anticorps selon Tune 
1 0 des revendications 79 et 80; 

b) mise en evidence du complexe antigene-anticorps eventuellement forme. 

82. Procede pour la detection de 1' expression d'un gene de Listeria monocytogenes 
caracterise en ce que Ton met en contact une souche de Listeria monocytogenes, 
avec un anticorps selon la revendication 79 ou 80 et que Ton detecte le 

1 5 complexe antigene/anticorps eventuellement forme. 

83. Kit ou necessaire pour la mise en oeuvre d'un procede selon la revendication 81 
ou 82, caracterise en ce qu'il comprend les elements suivants : 

a) un anticorps selon l'une des revendications 79 et 80; 

b) eventuellement, les reactifs pour la constitution du milieu propice a la 
20 reaction immunologique ; 

c) eventuellement, les reactifs permettant la mise en evidence des 
complexes antigene-anticorps produits par la reaction immunologique. 

84. Polypeptide selon l'une des revendications 5 a 7, 27 a 44, 74 et 76, ou anticorps 
selon l'une des revendications 79 et 80, caracterise en ce qu'il est immobilise 

25 sur un support, notamment une puce a proteine. 

85. Puce a proteine, caracterisee en ce qu'elle contient au moins un polypeptide 
selon l'une des revendications 5 a 7, 27 a 44, 74 et 76, ou au moins un anticorps 
selon l'une des revendications 79 et 80, immobilise sur le support de ladite 
puce. 

30 86. Puce a proteine selon la revendication 85, caracterisee en ce qu'elle contient en 
outre au moins un polypeptide de micro-organisme autre que Listeria 
monocytogenes ou au moins un anticorps dirige contre un compose de micro- 
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organisme autre que Listeria monocytogenes, immobilise sur le support de ladite 
puce. 

87. Kit ou necessaire pour la detection et/ou Y identification de bacteries appartenant 
a l'espece Listeria monocytogenes ou a un micro-organisme associe, caracterise 

5 en ce qu'il comprend une puce a proteine selon Tune des revendications 85 et 
86. 

88. Kit ou necessaire pour la detection et/ou Identification d'un micro-organisme, 
caracterise en ce qu'il comprend une puce a proteine selon la revendication 86. 

89. Procede de detection et/ou d'identification de bacteries appartenant a l'espece 
1 0 Listeria monocytogenes ou a un micro-organisme associe dans un echantillon 

biologique, caracterise en ce qu'il met en oeuvre une sequence nucleotidique 
selon Tune des revendications 1 a 4, 8 a 26, 51 a 55 et 77. 

90. Procede selon la revendication 89, caracterise en ce qu'il comporte les etapes 
suivantes : 

15 a) eventuellement, isolement de TADN a partir de 1'echantillon biologique a 

analyser, ou obtention d'un ADNc a partir de l'ARN de 1' echantillon 
biologique ; 

b) amplification specifique de l'ADN de bacteries appartenant a l'espece 
Listeria monocytogenes ou a un micro-organisme associe a l'aide d'au 

20 moins une amorce selon Tune des revendications 51 a 55 ; 

c) mise en evidence des produits d'amplifi cation. 

91. Procede selon la revendication 89 ? caracterise en ce qu'il comprend les etapes 
suivantes : 

a) mise en contact d'une sonde nucleotidique selon Tune des 
25 revendications 51 a 55, avec un echantillon biologique, Pacide nucleique 

contenu dans F echantillon biologique ayant, le cas echeant, 
prealablement ete rendu accessible a 1' hybridation, dans des conditions 
permettant l'hybridation de la sonde a l'acide nucleique d'une bacterie 
appartenant a Y espece Listeria monocytogenes ou a un micro-organisme 
30 associe ; 

b) mise en evidence de l'hybride eventuellement forme entre la sonde 
nucleotidique et l'acide nucleique de 1'echantillon biologique. 
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92. Procede selon la revendication 89, caracterise en ce qu'il comprend les etapes 
suivantes : 

a) mise en contact d'une sonde nucleotidique immobilisee sur un support 
selon la revendication 53 avec un echantillon biologique, Facide 

5 nucleique de Pechantillon ayant, le cas echeant, ete prealablement rendu 

accessible a P hybridation, dans des conditions permettant 1' hybridation 
de la sonde a Facide nucleique d'une bacterie appartenant a Pespece 
Listeria monocytogenes ou a un micro-organisme associe ; 

b) mise en contact de Phybride forme entre la sonde nucleotidique 
10 immobilisee sur un support et Facide nucleique contenu dans 

P echantillon biologique, le cas echeant apres elimination de Pacide 
nucleique de Pechantillon biologique n'ayant pas hybride avec la sonde, 
avec une sonde nucleotidique marquee selon la revendication 52 ; 

c) mise en evidence du nouvel hybride forme a Petape b). 

15 93. Procede selon la revendication 92, caracterise en ce que, prealablement a Petape 
a), PADN de Pechantillon biologique ou PADNc obtenu eventuellement par 
transcription inverse de PAKN de Pechantillon, est amplifie a Paide d'au moins 
une amorce selon Pune des revendications 51 a 55. 

94. Kit ou necessaire pour la detection et/ou {'identification de bacteries 
20 appartenant a Pespece Listeria monocytogenes ou a un micro-organisme 

associe, caracterise en ce qu ? il comprend les elements suivants : 

a) une sonde nucleotidique selon Pune des revendications 51 a 55; 

b) eventuellement, les reactifs necessaires a la mise en ceuvre d'une 
reaction d' hybridation ; 

25 c) eventuellement, au moins une amorce selon Pune des revendications 51 

a 55 ainsi que les reactifs necessaires a une reaction d'amplification de 
PADN. 

95. Kit ou necessaire pour la detection et/ou Pidentification de bacteries appartenant 
a Pespece Listeria monocytogenes ou a un micro-organisme associe, caracterise 

30 en ce qu'il comprend les elements suivants : 

a) une sonde nucleotidique, dite sonde de capture, selon la 
revendication 53 ; 
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b) une sonde oligonucleotidique, dite sonde de revelation, selon la 
revendication 52; 

c) eventuellement, au moins une amorce selon Tune des 
revendications 51 a 55 ainsi que les reactifs necessaires a une 

5 reaction d'amplification de l'ADN. 

96. Kit ou necessaire pour la detection et/ou 1' identification de bacteries appartenant 
a 1' espece Listeria monocytogenes ou a un micro-organisme associe, caracterise 
en ce qu'il comprend les elements suivants : 

a) au moins une amorce selon Tune des revendications 51 a 55; 
10 b) eventuellement, les reactifs necessaires pour effectuer une reaction 

d'amplification d' ADN ; 
c) eventuellement, un composant permettant de verifier la sequence du 
fragment amplifie, plus particulierement une sonde oligonucleotidique 
selon Tune des revendications 51 a 55. 
15 97. Procede selon les revendications 81, 82 et 89 a 93 ou kit ou necessaire selon les 
revendications 83, 87, 88 et 94 a 96 pour la detection et/ou T identification de 
bacteries appartenant a 1' espece Listeria monocytogenes, caracterise en ce que 
ladite amorce et/ou ladite sonde sont choisies parmi les sequences 
nucleotidiques selon Tune des revendications 2 a 4, 8 a 26, 51 a 55 et 77 
20 specifiques de 1 'espece Listeria monocytogenes, en ce que lesdits polypeptides 

sont choisis parmi les polypeptides selon Tune des revendications 5 a 7, 27 a 44 
et 74 ou 76 specifiques de 1' espece Listeria monocytogenes et en ce que lesdits 
anticorps sont choisis parmi les anticorps selon l'une des revendications 79 et 80 
diriges contre les polypeptides choisis parmi les polypeptides selon Tune des 
25 revendications 5 a 7, 26 a 44, 74 ou 76 specifiques de F espece Listeria 
monocytogenes. 

98. Procede ou kit ou necessaire selon la revendication 97, caracterise en ce que 
ladite amorce et/ou ladite sonde sont choisies parmi les sequences 
nucleotidiques codant pour une proteine secretee, en ce que lesdits polypeptides 
30 sont choisis parmi les polypeptides secretes et en ce que lesdits anticorps sont 

choisis parmi les anticorps selon Tune des revendications 79 et 80 diriges contre 
des polypeptides secretes ou de surface de Listeria monocytogenes, lesdits 
polypeptide etant de preference SEQ ID N° 2 a SEQ ID N°41 . 
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99. Souche de Listeria monocytogenes, caracterisee en ce qu'elle contient au moins 
une mutation dans au moins une sequence nucleotidique selon Tune des 
revendications 2 a 4 ou 8 a 26. 

100. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
5 ce qu'elle contient une mutation dans une sequence nucleotidique selon la 

revendication 13. 

101. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique selon la 
revendication 14. 

10 102. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique selon la 
revendication 9, 19 ou 20. 

103. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique selon la 

15 revendication 12, en particulier codant pour une proteine impliquee dans les 
mecanismes de secretion. 

104. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique codant pour 
une proteine impliquee dans la resistance et/ou 1'adaptation au stress. 

20 105. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique codant pour 
une proteine impliquee dans la biosynthese de la vitamine B12. 
106. Souche de Listeria monocytogenes selon l'une des revendications 99 a 105, 
caracterisee en ce que la mutation mene a une inactivation du gene. 

25 107. Souche de Listeria monocytogenes selon Tune des revendications 99 a 105, 
caracterisee en ce que la mutation mene a une surexpression du gene. 

108. Utilisation d'un polypeptide selon Tune des revendications 5 a 7, 27 a 44, 74 
et 76 d'une cellule transformee selon l'une des revendications 69 a 71 d'une 
souche selon la revendication 99 a 107 et/ou d'un animal selon la revendication 

30 72, pour la biosynthese ou la biodegradation d'un compose d'interet. 

109. Procede de biosynthese ou de biodegradation d'un compose d'interet, 
caracterise en ce qu'il met en oeuvre un polypeptide selon l'une des 
revendications 5 a 7, 26 a 44, 74 et 76, une cellule transformee selon l'une des 
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revendications 69 a 71 une souche selon Tune des revendications 99 a 107 et/ou 
un animal selon la revendication 72. 
110. Utilisation selon la revendication 108 ou procede selon la revendication 109, 
caracterises en ce que le compose d'interet est la vitamine B12. 
5 111. Utilisation d'une sequence nucleotidique selon Tune des revendications 2 a 
4, et, 8 a 26, d'un polypeptide selon Tune des revendications 5 a 7, 27 a 44, 74 
et 76, d'un anticorps selon Tune des revendications 79 et 80, d'une cellule 
selon Tune des revendications 69 a 71, et/ou d'un animal transforme selon la 
revendication 72 pour la selection de compose organique ou inorganique 

10 capable de moduler, de reguler, d'induire ou d'inhiber l'expression de genes, 
et/ou de modifier la replication cellulaire de cellules eucaryotes ou procaryotes 
ou capables d'induire, d'inhiber ou d'aggraver chez un organisme animal ou 
humain les pathologies liees a une infection par Listeria monocytogenes ou par 
un micro-organisme associe. 

15 112. Methode de selection de compose capable de se lier a un polypeptide selon 
1'une des revendications 5 a 7, 27 a 44, 74 et 76, capable de se lier a une 
sequence nucleotidique selon Tune des revendications 2 a 4, et, 8 a 26, ou 
capable de reconnaitre un anticorps selon Tune des revendications 79 et 80, 
et/ou capable de moduler, de reguler, d'induire ou d'inhiber l'expression de 

20 genes, et/ou de modifier la replication cellulaire de cellules eucaryotes ou 
procaryotes, ou capables d'induire, d'inhiber ou d'aggraver chez un organisme 
animal ou humain les pathologies liees a une infection par Listeria 
monocytogenes, caracterisee en ce qu'elle comprend les etapes suivantes : 

a) mise en contact dudit compose avec ledit polypeptide, ladite sequence 
25 nucleotidique, avec une cellule transformee selon 1'une des 

revendications 69 a 71, et/ou administration dudit compose a un animal 
transforme selon la revendication 72; 

b) determination de la capacite dudit compose a se lier avec ledit 
polypeptide ou ladite sequence nucleotidique, ou de moduler, de reguler, 

30 d'induire ou d'inhiber l'expression de genes, ou de moduler la 

croissance ou la replication cellulaire, ou d'induire, d'inhiber ou 
d'aggraver chez ledit organisme animal ou humain les pathologies liees 
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a une infection par Listeria monocytogenes ou par un micro-organisme 
associe. 

113. Compose susceptible d'etre selectionne par une methode selon la 
revendication 112. 

5 114. Composition pharmaceutique comprenant un compose choisi parmi les 
composes suivants : 

a) une sequence nucleotidique selon Tune des revendications 2 a 4, et 8 a 
26; 

b) un polypeptide selon Tune des revendications 5 a 7, 27 a 44, 74 et 76 ; 
10 c) un vecteur selon Tune des revendications 62 a 68, et 78 ; 

d) un anticorps selon la revendication 79 ou 80 ; et 

e) un compose selon la revendication 113. 

115. Composition selon la revendication 114, eventuellement en association avec 
un vehicule pharmaceutiquement acceptable. 
15 116. Composition pharmaceutique selon Tune des revendications 1 14 et 115 pour 
la prevention ou le traitement d'une infection par une bacterie appartenant a 
Tespece Listeria monocytogenes ou par un micro-organisme associe. 

117. Composition vaccinate, caracterisee en ce qu'elle comprend un ou plusieurs 
polypeptides selon Tune des revendications 5 a 7, 27 a 44, 74, et/ou un ou 

20 plusieurs polypeptides hybrides selon la revendication 76. 

118. Utilisation d'une cellule selon Tune des revendications 62 a 68, pour la 
preparation d'une composition vaccinale. 

1 19. Composition vaccinale, caracterisee en ce qu'elle contient un vecteur selon 
Tune des revendications 62 a 68 et 78, et/ou une cellule selon 1'une des 

25 revendications 69 a 71 . 

120. Composition vaccinale capable d'induire une reponse immunitaire cellulaire 
ou humorale pour la prevention ou Ie traitement d'une infection par bacterie 
appartenant a l'espece Listeria monocytogenes ou par un micro-organisme 
associe, caracterisee en ce qu'elle comprend une composition immunogene 

30 selon 1'une des revendications 117 et 119, en association avec un vehicule 
pharmaceutiquement acceptable et, eventuellement un ou plusieurs adjuvants de 
l'immunite appropries. 
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121. Banque d'ADN genomique d'une bacterie du genre Listeria, caracterisee en 
ce que ladite banque d'ADN est clonee dans des chromosomes artificiels de 
bacteries (BAC). 

122. Banque d'ADN genomique selon la revendication 121, caracterisee en ce 
5 que ladite bacterie est Listeria monocytogenes. 

123. Banque d'ADN genomique selon la revendication 121, caracterisee en ce 
que ladite bacterie est la souche EGD-e de Listeria monocytogenes. 

124. Banque selon la revendication 122, caracterisee en ce qu'il s'agit de la 
banque LM-baclib deposee a la CNCM le 11 Avril 2000, sous le numero 

10 d'ordre 1-2439. 

125. Methode pour 1'isolement d'un polynucleotide d'interet present chez une 
souche de Listeria et absente chez une autre souche, caracterise en ce quelle 
utilise au moins une banque selon Tune des revendications 121 ou 124. 

126. Methode selon la revendication 125, caracterisee en ce qu'elle comprend les 
1 5 etapes suivantes : 

a) isoler au moins un polynucleotide contenu dans un clone de la banque 
d'ADN basee sur un BAC, d'origine de Listeria, 

b) isoler : 

- au moins un polynucleotide genomique ou ADNc d'une Listeria, ladite 
20 Listeria appartenant a une souche differente de la souche utilisee pour la 

construction de la banque d'ADN BAC de l'etape a) ou, de fa?on 
alternative, 

au moins un polynucleotide contenu dans un clone d'une banque d'ADN 
basee sur un BAC prepare a partir du genome d'une Listeria qui est 
25 differente de la Listeria utilisee pour la construction de la banque 

d'ADN baseb sur le BAC de l'etape a). 

c) hybrider le polynucleotide de l'etape a) au polynucleotide de l'etape b) ; 

d) selectionner les polynucleotides de l'etape a) qui n'ont pas forme de 
complexe d'hybridation avec les polynucleotides de l'etape b) ; 

30 e) caracteriser le polynucleotide selectionne. 

127. Banque genomique d'une bacterie du genre Listeria. 

128. Banque genomique selon la revendication 127, caracterisee en ce que la 
bacterie est Listeria monocytogenes. 



